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NOVEL MOLECULES OF THE TNF RECEPTOR SUPERFAMILY 

AND USES THEREFOR 

Background of the Invention 

5 The tumor necrosis factor receptor (TNFR) superfamily of proteins encompasses 

over a dozen members, most of which are type I transmembrane proteins, related by the 
presence of conserved cysteine-rich repeats (CRRs) in their N-terminai cysteine-rich 
domains (CRDs). Members of the TNFR superfamily include TNFR1 (p55). TNFR2 
(p75) r TNFR3 (TNF-RP). Fas (also known as CD95 and Apol). OX-40 ? 41-BB. CD40. 

10 CD30. CD27. OPG. and p75 NGFR. (Smith et aL (1993) Cell 76:959-962; Armitage. 
R.J. (1994) Curr. Opin. Immunol. 6:407-413: Gruss et aL (1995) Blood 85, 3378-3404; 
Baker et aL (1996) Oncogene 12:1-9; and Simonet et aL (1997) Cell 89:309-3 19.) A 
TNFR superfamily member is typically a membrane-bound, trimeric or multimeric 
complex which is stabilized via intracysteine disulfide bonds that are formed between 

1 5 the cysteine-rich domains of individual subunit members (Banner et aL ( 1 993) Cell 

73:43 1-445). The proteins themselves do not have intrinsic catalytic activity, rather they 
function via association with other proteins to transduce cellular signals. 

Most members of the TNFR superfamily recognize ligands that play critical roles 
as costimulators in immune responses. However, a subset of TNFR superfamily 

20 members have been determined to play a key role in the extracellular regulation of cell 
death. Induction of cell death requires a unique cytoplasmic motif which was originally 
identified in TNFR1 and Fas and termed the "death domain" (Tartaglia et aL (1993) Cell 
74:845-853 and Itoh and Nagata ( 1 993) J. Biol Chem. 268:10932-10937). Using the 
yeast two-hybrid method to clone genes encoding proteins that associate with the 

25 cytoplasmic domains of TNFR 1 or Fas. three dramatically different genes were 

identified (TRADD in Hsu et aL (1995) Cell 81 :495-504: FADD in Chinnaiyan et aL 
(1995) Cell 81:501-512; and RIP in Stanger et aL (1995) Cell 81 :5 12-523). FADD was 
independently cloned with the same strategy, and termed MORT1 (Boldin et aL (1995) 
J. Biol. Chem. 270:7795-7798.) In fact, the only structural similarity between these 

30 proteins was the shared motif that has homology with the death domains of the TNFR1 
and Fas receptors. Death domains have recently been identified in a variety of proteins 
including, for example, the ankyrins. the Drosophila proteins PELLE and TUBE. DAP 
kinase, mouse myD88. (For review see Feinstein and Kimchi (1995) Trends. Biochem. 
Sci. 20:342-344; Golstein et aL (1995) Cell 81:185-186; Cleveland and Ihle (1995) Cell 

35 8 1:479-482; and Hofman and TschoppO 995) FEBS Lett. 371:321-323). Moreover, the 
death domain has been implicated in protein: protein interactions between two proteins 
each containing such a domain. Such a death domaimdeath domain interaction is 
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believed to be a crucial component of the cellular signal transduction pathways that lead 
to cell death, thus, implicating members of the TNFR superfamily in a wide range of 
signal transduction with appreciably diverse outcomes. 

Aside from the membrane-bound forms of TNFR superfamily proteins that 
function as cellular signal transducers, a functional TNFR superfamily protein can also 
exist in a soluble form. Soluble versions of the superfamily bind cognate ligands and 
influence bioavailability. For instance, the osteoprotegerin protein family exists as a 
soluble protein (Simonet et al. (1997) Cell 89:309-319). Many soluble forms of the 
TNFR have been identified. Certain soluble TNFRs are elevated in disease states such 
as lupus and rheumatoid arthritis (Gabay et al. (1997) J. Rheumatol. 24(2):303-308). 
The soluble superfamily members lack the transmembrane domain characteristic of the 
majority of superfamily members due to either proteolytic cleavage or, at least in one 
instance, to alternative splicing (Gruss et al. (1995) Blood 85. 3378-3404). 

Given the important role of proteins of the TNFR superfamily. including both 
soluble as well as membrane-bound family members, in a wide range of cellular signal 
transduction pathways, there exists a need for identifying novel members of the TNFR 
superfamily as well as for modulators of such molecules for use in regulating a variety of 
cellular responses. 

Summary of the Invention 

The present invention is based, at least in part, on the discovery of novel 
molecules of the TNF receptor superfamily. referred to herein as TNF receptor-like 
M TRL" nucleic acid and protein molecules. The TRL molecules of the present invention 
are useful as modulating agents in regulating a variety of cellular processes. 
Accordingly, in one aspect, this invention provides isolated nucleic acid molecules 
encoding TRL proteins or biologically active portions thereof, as well as nucleic acid 
fragments suitable as primers or hybridization probes for the detection of TRL-encoding 
nucleic acids. In one embodiment, an isolated nucleic acid molecule of the present 
invention preferably encodes a TRL protein which includes a cysteine-rich domain, a C- 
terminal unique domain and is membrane bound or secreted. In another embodiment, 
the nucleic acid molecule is a naturally occurring nucleotide sequence. 

In another embodiment, a nucleic acid molecule of the invention is 60% 
homologous to the nucleotide sequence shown in SEQ ID NO:l. SEQ ID NO:3. SEQ ID 
NO:22, the nucleotide sequence of the DNA insert of the plasmid deposited with ATCC 
as Accession Number 98649, or a complement thereof and, preferably, encodes a TRL 
protein. In yet another embodiement, the isolated nucleic acid molecule is 60% 
homologous to the nucleotide sequence shown in SEQ ID NO:5, SEQ ID NO:6, SEQ ID 
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NO:24, or a complement thereof and, preferably, encodes a TRL protein. In a preferred 
embodiment, the isolated nucleic acid molecule encodes the amino acid sequence of 
human or mouse TRL protein. 

In another embodiment, the isolated nucleic acid includes a nucleotide sequence 
5 encoding a protein having an amino acid sequence sufficiently homologous to a 

cysteine-rich domain amino acid sequence of SEQ ID NO:2 ? SEQ ID NO:4, or SEQ ID 
NO:23 and. preferably, encodes a TRL protein. In a preferred embodiment, the nucleic 
acid molecule has the nucleotide sequence shown in SEQ ID NO:5, SEQ ID NO:6, or 
SEQ ID NO:24. In another preferred embodiment, the nucleic acid molecule has the 

10 nucleotide sequence shown in SEQ ID NO:L SEQ ID NO:3. SEQ ID NO:22 ? or the 
nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98649. 

Another embodiment of the invention features isolated nucleic acid molecules 
which specifically detect TRL nucleic acid molecules relative to nucleic acid molecules 

1 5 encoding other TNFR superfamily molecules. For example, in one embodiment, the 
nucleic acid molecule hybridizes under stringent conditions to a nucleic acid molecule 
comprising the nucleotide sequence shown in nucleotides 480 to 1 165 of SEQ ID NO:l 
or nucleotides 455 to 2155 of SEQ ID NO:3. In another embodiment, the nucleic acid 
molecule is at least 500 nucleotides in length. In another embodiment, the nucleic acid 

20 molecule is at least 500 nucleotides in length and hybridizes under stringent conditions 
to a nucleic acid molecule comprising the nucleotide sequence shown in SEQ ID NO:l. 
SEQ ID NO:3, SEQ ID NO:22. the nucleotide sequence of the DNA insert of the 
plasmid deposited with ATCC as Accession Number 98649, or a complement thereof 
In yet another embodiment, the nucleic acid molecule is at least 500 nucleotides in 

25 length and encodes a TRL or portion thereof, preferably a biologically active portion 
thereof 

In a preferred embodiment, an isolated nucleic acid molecule comprises 
nucleotides 344-2065 of SEQ ID NO: 1 or a complement thereof In another 
embodiment, the nucleic acid molecule further comprises nucleotides 1-343 of SEQ ID 

30 NO: 1 . In yet another preferred embodiment, the nucleic acid molecule further comprises 
nucleotides 2066-333 1 of SEQ ID NO: 1 . 

In another preferred embodiment of the invention, an isolated nucleic acid 
molecule comprises nucleotides 190-95 1 of SEQ ID NO:3 or a complement thereof. In 
another embodiment, the nucleic acid molecule further comprises nucleotides 1-189 of 

35 SEQ ID NO:3. In yet another preferred embodiment, the nucleic acid molecule further 
comprises nucleotides 952-2612 of SEQ ID NO:3. 
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In another preferred embodiment of the invention, an isolated nucleic acid 
molecule comprises nucleotides 510-2324 of SEQ ID NO:22 or a complement thereof. 
In another embodiment, the nucleic acid molecule further comprises nucleotides 1-509 of 
SEQ ID NO:22. In yet another preferred embodiment, the nucleic acid molecule further 
comprises nucleotides 2325-2638 of SEQ ID NO:22. 

Another embodiment the invention provides an isolated nucleic acid molecule 
which is antisense to the coding strand of a TRL nucleic acid. 

Another aspect of the invention provides a vector comprising a nucleic acid 
molecule of the invention, preferably a TRL nucleic acid molecule. In certain 
embodiments, the vector is a recombinant expression vector. In another embodiment the 
invention provides a host cell containing a vector of the invention. The invention also 
provides a method for producing TRL protein by culturing in a suitable medium, a host 
cell of the invention containing a recombinant expression vector such that TRL protein is 
produced. 

Another aspect of this invention features isolated or recombinant proteins and 
polypeptides, preferably TRL proteins and polypeptides. In one embodiment, the 
isolated protein, preferably a TRL protein, has a cysteine-rich domain, a C-terminal 
unique domain and is membrane bound or secreted. In another embodiment, an isolated 
protein, preferably a TRL protein, has an amino acid sequence sufficiently homologous 
to a cysteine-rich domain amino acid sequence of SEQ ID NO:2, SEQ ID NO:4. or SEQ 
ID NO:23. In another embodiment, the invention features fragments of the proteins 
having the amino acid sequence of SEQ ID NO:2. SEQ ID NO:4, or SEQ ID NO:23. 
wherein the fragment comprises at least 1 5 contiguous amino acids of the amino acid 
sequence of SEQ ID NO:2. SEQ ID NO:4. SEQ ID NO:23, or an amino acid sequence 
encoded by the DNA insert of the plasmid deposited with the ATCC as Accession No. 
98649. In a preferred embodiment, the protein has the amino acid sequence of SEQ ID 
NO:2, SEQ ID NO:4, or SEQ ID NO:23. 

Another embodiment of the invention features isolated proteins, preferably TRL 
proteins, having an amino acid sequence at least about 60% homologous to the amino 
acid sequence of SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:23. Yet another 
embodiment of the invention features isolated protein, preferably TRL proteins, which 
are encoded by nucleic acid molecules having a nucleotide sequence at least about 60% 
homologous to a nucleotide sequence of SEQ ID NO:5. SEQ ID NO:6, SEQ ID NO:l, 
SEQ ID NO:3, SEQ ID NO:22, the nucleotide sequence of the DNA insert of the 
plasmid deposited with ATCC as Accession Number 98649, or a complement thereof. 
This invention further features isolated proteins, preferably TRL proteins, which are 
encoded by a nucleic acid molecules having a nucleotide sequence which hybridizes 
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under stringent hybridization conditions to a nucleic acid molecule comprising the 
nucleotide sequence of SEQ ID NO:5 r SEQ ID NO:6. SEQ ID NO: 1 . SEQ ID NO:3 r 
SEQ ID NO:22. or the nucleotide sequence of the DNA insert of the plasmid deposited 
with ATCC as Accession Number 98649. 

The proteins of the present invention, preferably TRL proteins, or portions 
thereof, (e.g.. biologically active portions thereof) can be operatively linked to a non- 
TRL polypeptide to form fusion proteins, preferably TRL fusion proteins. The invention 
further features antibodies that specifically bind TRL proteins, such as monoclonal or 
polyclonal antibodies. In addition, the proteins of the invention or biologically active 
portions thereof can be incorporated into pharmaceutical compositions, which optionally 
include pharmaceutically acceptable carriers. 

In another aspect, the present invention provides a method for detecting the 
presence of TRL activity or expression in a biological sample by contacting the 
biological sample with an agent capable of detecting an indicator of TRL activity such 
that the presence of TRL activity is detected in the biological sample. 

In another aspect, the invention provides a method for modulating TRL activity 
comprising contacting a cell capable of expressing TRL with an agent that modulates . 
TRL activity such that TRL activity in the cell is modulated. In one embodiment, the ■ 
agent inhibits TRL activity. In another embodiment, the agent stimulates TRL activity. 
In one embodiment, the agent is an antibody that specifically binds to TRL protein. In 
another embodiment, the agent modulates expression of TRL by modulating 
transcription of a TRL gene or translation of a TRL mRNA. In yet another embodiment, 
the agent is a nucleic acid molecule having a nucleotide sequence that is antisense to the 
coding strand of the TRL mRNA or the TRL gene. 

In one embodiment, the methods of the present invention are used to treat a 
subject having a disorder characterized by aberrant TRL protein or nucleic acid 
expression or activity by administering an agent which is a TRL modulator to the 
subject. In one embodiment, the TRL modulator is a TRL protein. In another 
embodiment the TRL modulator is a TRL nucleic acid molecule. In yet another 
embodiemenL the TRL modulator is a peptide, peptidomimetic, or other small molecule. 
In a preferred embodiment, the disorder characterized by aberrant TRL protein or nucleic 
acid expression is a proliferative (e.g. T cancer, for example, pancreatic cancer), a 
differentiative disorder, diabetes of insulin resistance. 

The present invention also provides a diagnostic assay for identifying the 
presence or absence of a genetic lesion characterized by at least one of (i) aberrant 
modification or mutation of a gene encoding a TRL protein; fii) mis-regulation of said 
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gene; and (iii) aberrant post-translational modification of a TRL protein, wherein a wild- 
type form of said gene encodes an protein with a TRL activity. 

In another aspect, the invention provides a method for identifying a compound 
that binds to or modulates the activity of a TRL protein, by providing a indicator 
composition comprising a TRL protein having TRL activity, contacting the indicator 
composition with a test compound, and determining the effect of the test compound on 
TRL activity in the indicator composition to identify a compound that modulates the 
activity of a TRL protein. 

Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 

Brief Description of the Drawings 

Figure 1 depicts the cDNA sequence and predicted amino acid sequence of 
murine TRL. The nucleotide sequence corresponds to nucleic acids 1 to 3331 of SEQ ID 
NO: 1 . The amino acid sequence corresponds to amino acids 1 to 573 of SEQ ID NO:2. 

Figure 2 depicts the cDNA sequence and predicted amino acid sequence of 
human TRL L The nucleotide sequence corresponds to nucleic acids 1 to 2612 of SEQ 
ID NO:3. The amino acid sequence corresponds to amino acids 1 to 273 of SEQ ID 
NO:4. 

Figure 3 depicts an alignment of the amino acid sequences of human TRL I 
(corresponding to amino acids 1 to 273 of SEQ ID NO:4), human TRL II (corresponding 
to amino acids 1 to 605 of SEQ ID NO:23). murine TRL (corresponding to amino acids 
1 to 573 of SEQ ID NO:2, human TNFR2 precursor (Swiss-Prot™ Accession No. 
P20333) (corresponding to amino acids 1 to 461 of SEQ ID NO:7). murine TNFR2 
precursor (Swiss-Prot™ Accession No. P251 19) (corresponding to amino acids 1 to 474 
of SEQ ID NO:8), soluble TNFR2 precursor (Swiss-Prot™ Accession No. P25943) 
(corresponding to amino acids 1 to 325 of SEQ ID NO:9), human CD40 receptor 
precursor (Swiss-Prot™ Accession No. P25942) (corresponding to amino acids 1 to 277 
of SEQ ID NO: 10), murine CD40 receptor precursor (Swiss-Prot™ Accession No. 
P27512) (corresponding to amino acids 1 to 289 of SEQ ID NO:l 1) ; human 
osteoprotegerin (Swiss-Prot™ Accession No. U94332) (corresponding to amino acids 1 
to 401 of SEQ ID NO: 12), and murine osteoprotegerin (Swiss-Prot™ Accession No. 
U94331) (corresponding to amino acids 1 to 401 of SEQ ID NO: 13). A signal sequence 
is designated by bold characters. Cysteine-rich domains are designated alternatively by 
italicized characters or by underlined, italicized characters. A transmembrane domain is 
indicated by bold, underlined characters. The alignment was performed using the 
Clustal algorithm which is part of the MEGALIGN program (e.g., version 3.1.7) which 
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is part of the DNASTAR sequence analysis software package. The pairwise alignment 
parameters are as follows: K-tuple = 1; Gap Penalty = 3; Window = 5; Diagonals saved 
= 5 . The multiple alignment parameters are as follows: Gap Penalty = 10; and Gap 
length penalty = 1 0. 

5 Figure 4 depicts the cDNA sequence and predicted amino acid sequence of 

human TRL II. The nucleotide sequence corresponds to nucleic acids 1 to 2638 of SEQ 
ID NO:22. The amino acid sequence corresponds to amino acids 1 to 605 of SEQ ID 
NO:23. 

Figure 5 depicts an alignment of the amino acid sequences of human TRL II 

10 (corresponding to amino acids 1 to 605 of SEQ ID NO:23) ? human TRL I (corresponding 
to amino acids 1 to 273 of SEQ ID NO:4). and murine TRL (corresponding to amino 
acids 1 to 573 of SEQ ID NO:2. 

Figure 6 depicts an alignment of the amino acid sequences of the human TNFR1 
(Swiss-Prot™ Accession No.P 19438) death domain (corresponding to amino acids 356- 

15 441 of human TNFR1 or. alternatively, corresponding to amino acids 1-86 of SEQ ID 
NO:25) ? the human Fas (Swiss-Prot™ Accession No. P25445) death domain 
(corresponding to amino acids 230-3 14 of human Fas or ? alternatively, corresponding to 
amino acids 1-85 of SEQ ID NO:26). the human TRADD (GenBank™ Accession No. - 
L41690) death domain (corresponding to amino acids 231-321 human TRADD or. 

20 alternatively, corresponding to amino acids 1-91 of SEQ ID NO:27), the human FADD 
(EMBL™ Accession No. X84709) death domain (corresponding to amino acids 97- 181 
of human FADD or, alternatively, corresponding to amino acids 1-85 of SEQ ID 7 
NO:28). the human RIP (Swiss-Prot™ Accession No. U25994) death domain 
(corresponding to amino acids 284-370 of human RIP or. alternatively, corresponding to 

25 amino acids 1 -87 of SEQ ID NO:29). the human TRL II death domain (corresponding to 
amino acids 4 1 5-498 of SEQ ID NO:23 or alternatively, amino acids 1 -84 of SEQ ID 
NO:30), and the mouse TRL death domain (corresponding to amino acids 333-416 of 
SEQ ID NO:2 or alternatively, amino acids 1-84 of SEQ ID NO:31). Identical or similar 
amino acids are indicated in bold. Highly conserved amino acids are indicated by 

30 astrices. The alignment was performed as in Figure 3. 

Detailed Description of the Invention 

The present invention is based on the discovery of novel molecules of the TNF 
receptor superfamily, referred to herein as TRL protein and nucleic acid molecules. 
35 which comprise a family of molecules having certain conserved structural and functional 
features. The term "family" when referring to the protein and nucleic acid molecules of 
the invention is intended to mean two or more proteins or nucleic acid molecules having 
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a common structural domain or motif and having sufficient amino acid or nucleotide 
sequence homology as defined herein. Such family members can be naturally occurring 
and can be from either the same or different species. For example, a family can contain 
a first protein of human origin and a homologue of that protein of murine origin, as well 
5 as a second, distinct protein of human origin and a murine homologue of that protein. 
Members of a family may also have common functional characteristics. 

In one embodiment, an isolated protein of the invention, preferably a TRL 
protein, is identified based on the presence of at least one "cysteine-rich domain" in the 
protein or corresponding nucleic acid molecule. As used herein, a "cysteine-rich 

10 domain" includes a protein domain having an amino acid sequence of at least about 20 
amino acid residues of which about 2 amino acids are the amino acid residue cysteine. 
In a preferrred embodiment, a cysteine-rich domain includes at least about 30. more 
preferably at least about 35-40 amino acid residues, of which at least about about 3. more 
preferably at least about 4. 5 or 6 amino acids are the amino acid residue cysteine. 

15 Cysteine-rich domains having lengths of 45-50 or 60 amino acid residues and having up 
to 7, 8, 9 or 10 cysteine residues are also within the scope of this invention. In one 
embodiment, an isolated protein, preferably a TRL protein, includes a cysteine rich 
domain having at least about 20%, preferably at least about 30%. and more preferably 
about 40% amino acid sequence homology to a TNFR cysteine-rich domain (also 

20 referred to as a cysteine-rich repeat or "CRR"), such as the TNFR cysteine-rich domain 
of SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:23 (e.g., amino acid residues 39-77, 78- 
1 19, 120-164. and 165-203 of murine TNFR2 or amino acid residues 39-76, 77-1 18, 
1 19-162, and 163-201 of human TNFR2. respectively). Preferably, the isolated protein 
includes at least two cysteine-rich domains, more preferably at least three cysteine-rich 

25 domains, and more preferably at least four or five cysteine-rich domains. For example, 
in one embodiment, the isolated protein is a TRL protein which contains two or more 
cysteine-rich domains and has at least about 20%. more preferably about 30%, and even 
more preferably about 40% homology to a TNFR cysteine-rich domain of SEQ ID NO:2, 
SEQ ID NO:4. or SEQ ID NO:23 (e.g., amino acid residues 39-203 of murine TNFR2 or 

30 amino acid residues 39-201 of human TNFR2, respectively). Cysteine rich domains are 
described in. for example, Lodish H. et al. Molecular Cell Biology, (Scientific American 
. Books Inc., New York, N.Y.. 1995), the contents of which are incorporated herein by 
reference. 
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Preferred molecules of the present invention have an amino acid sequence 
sufficiently homologous to a cysteine rich domain amino acid sequence of SEQ ID 
NO:2 ; SEQ ID NO:4 ; or SEQ ID NO:23 or a nucleotide sequence sufficiently 
homologous to a nucleotide sequence encoding a cysteine rich domain of SEQ ID NO:2 ? 
5 SEQ ID NO:4, or SEQ ID NO:23. As used herein, the term "sufficiently homologous" 
refers to a first amino acid or nucleotide sequence which contains a sufficient or 
minimum number of identical or equivalent (e.g. ; an amino acid residue which has a 
similar side chain) amino acid residues or nucleotides to a second amino acid or 
nucleotide sequence such that the first and second amino acid or nucleotide sequences 

1 0 share common structural domains or motifs and/or a common functional activity. For 
example, amino acid or nucleotide sequences which share common structural domains . 
have at least about 50% homology, preferably 60% homology, more preferably 70%- 
80%. and even more preferably 90-95% homology across the amino acid sequences of 
the domains and contain at least one and preferably two structural domains or motifs, are 

1 5 defined herein as sufficiently homologous. Furthermore, amino acid or nucleotide . - 
sequences which share at least 50%. preferably 60%. more preferably 70-80%. or 90- 
95% homology and share a common functional activity are defined herein as sufficiently 
homologous. In one embodiment, the a TRL protein contains a cysteine-rich domain and 
has a TRL activity. 

20 As used interchangeably herein a 'TRL activity 5 ', "biological activity of TRL ?? or 

"functional activity of TRL'\ refers to an activity exerted by a TRL protein, polypeptide 
or nucleic acid molecule on a TRL responsive cell as determined in vivo, or in vitro.' ^ 
according to standard techniques. In one embodiment, a TRL activity is a direct activity, 
such as an association with or an enzymatic activity on a second protein. In another 

25 embodiment, a TRL activity is an indirect activity, such as a cellular signaling activity 
mediated by interaction of the TRL protein with a second protein. In a preferred 
embodiment, a TRL activity is at least one or more of the following activities: (i) 
formation of a homogeneous multimeric signaling complex with like TRL proteins; (ii) 
formation of a heterogeneous multimeric signaling complex with other TNFR 

30 superfamily proteins or other cell-surface proteins; (iii) complex formation between a 

membrane-bound TRL protein and a cytokine; (iv) complex formation between a soluble 
TRL protein and a cognate ligand; (v) interaction of a TRL protein with an intracellular 
protein having substantial homology to the TNFR-associated proteins; (vi) interaction of 
a TRL protein with a TNFR-associated protein; and (vii) interaction of a TRL protein 

35 with other cellular proteins including cytoplasmic proteins (e.g. SH2 domain-containing 
proteins or a second death domain-containing protein) or cytoskeletal proteins. In yet 
another preferred embodiment, a TRL activity is at least one or more of the following 



BNSOOCIO:<WO 9915663A1> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/15663 



PCT/US98/20219 



- 10 - 

activities: (i) modulation of cellular signal transduction; (ii) regulation of cellular 
proliferation; (iii) regulation of cellular differentiation; (iv) regulation of cell survival or 
apoptosis; (v) modulation of a cell involved in the immune response; (vi) modulation of 
a cell involved in insulin resistance; and (vti) modulation of a diabetic response. 
5 Accordingly, another embodiment of the invention features isolated TRL proteins 

and polypeptides having a TRL activity. Preferred proteins are TRL proteins having at 
least one cysteine-rich domain (and preferably two or more cysteine-rich domains) and, 
preferably, a TRL activity. In another preferred embodiment, the isolated protein, 
preferably a TRL protein, has at least one cysteine-rich domain (and preferably two or 
10 more cysteine-rich domains), a TRL activity and an amino acid sequence sufficiently 
homologous to an amino acid sequence of SEQ ID NO:2. SEQ ID NO:4, or SEQ ID 
NO:23. 

Another embodiment of the invention features molecules, preferably TRL 
molecules, which contain a C-terminal unique domain. The term "C-terminal unique 

15 domain" as used herein, includes a protein domain of a TRL protein family member (or 
TNFR protein superfamily member) which includes amino acid residues C-terminal to 
the C-terminus of a cysteine-rich domain in the amino acid sequence of the TRL protein 
(or TNFR protein superfamily member), e.g., a protein domain which includes amino 
acid residues from the C-terminus of the cysteine-rich domain to the C-terminal amino 

20 acid residue of the amino acid sequence of the protein. In a preferred embodiment, a C- 
terminal unique domain includes at least about 370-460 amino acid residues. In another 
embodiment, a C-terminal unique domain includes at least about 380-450 amino acid 
residues, preferably about 390-440. more preferably about 400-430. and even more 
preferably about 410-420 amino acid residues. A C-terminal unique domain is 

25 sufficiently homologous between TRL protein family members such that the domain is 
at least about 40%, preferably about 50%. more preferably about 60%. even more 
preferably about 70%, 80%, or 90% homologous to the C-terminal unique domain of 
human TRL II (e.g., amino acid residues 213-605 of human TRL II) or mouse TRL (e.g., 
amino acid residues 131-573 of mouse TRL). As defined herein, a C-terminal unique 

30 domain of a TRL protein family member, however, is not sufficiently homologous to a 
C-terminal unique domain of a member of another protein family, such as a TNFR 
protein family. 

A C-terminal unique domain of a TRL protein family member can further 
comprise a "death domain". As used herein, a death domain comprises about 50-100, 
35 amino acids residues and is localized near the C-terminal end of a naturally-occurring, 
death-domain-containing protein. In a preferred embodiment, a death domain comprises 
about 60-90, preferably 70-80 amino acids residues and is localized near the C-terminal 
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end of a naturally-occurring, death-domain-containing protein. For example, the C- 
terminus of a death domain located near the C-terminal end of a protein can be located at 
least about 2-200 amino acid residues from the C-terminus of the protein. Preferably, 
the C-terminus of a death domain is located about 5-190, more preferably about 10-180, 
5 more preferably about 1 5-1 70, even more preferably about 20-160, and even more 
preferably about 25-1 50 amino acid residues from the C-terminus of the protein. 
Sequence homology among death domains is found at both at the C-terminal and N- 
terminal end of the death domain, with the intervening middle region frequently 
containing amino acid insertions or deletions. An alignment of several death domains is 

10 provided in Figure 6. There are several amino acid positions within the domain that are 
highly conserved among death domain-containing family members, in particular, the 
tryptophans and leucine indicated by astrices in Figure 6. In a preferred embodiment, the 
death domain has at least 2, preferably 3 leucines which are conserved. In another 
preferred embodiment, the death domain has at least 1. preferably 2 tryptophans which 

15 are conserved. A death domain is further predicted to have an overall a-helical structure. 
Accordingly, in one embodiment, proteins of the invention, preferably TRL 
proteins, contain at least one cysteine-rich domain (and preferably two or more cysteine- 
rich domains) and have an amino acid sequence sufficiently homologous to a C-terminal 
unique domain amino acid sequence of SEQ ID NO:2. SEQ ID NO:4, or SEQ ID NO:23. 

20 In another preferred embodiment, the isolated protein, preferably a TRL protein, has at 
least one cysteine-rich domain (and preferably two or more cysteine-rich domains), an 
amino acid sequence sufficiently homologous to a C-terminal unique domain amino acid 
sequence of SEQ ID NO:2. SEQ ID NO:4, or SEQ ID NO:23 and a TRL activity. In 
another preferred embodiment, the isolated protein, preferably a TRL protein, has at least 

25 one cysteine-rich domain (and preferably two or more cysteine-rich domains), an amino 
acid sequence sufficiently homologous to a C-terminal unique domain amino acid 
sequence of SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:23. wherein the C-terminal 
unique domain contains a death domain. In another preferred embodiment, the isolated 
protein, preferably a TRL protein, has at least one cysteine-rich domain (and preferably 

30 two or more cysteine-rich domains), an amino acid sequence sufficiently homologous to 
a C-terminal unique domain amino acid sequence of SEQ ID NO:2, SEQ ID NO:4, or 
SEQ ID NO:23. wherein the C-terminal unique domain contains a death domain, and has 
a TRL activity. 

In yet another embodiment of the invention, the isolated protein and nucleic acid 
35 molecules, preferably TRL -molecules, contain a transmembrane domain. As used 

herein, a "transmembrane domain" includes a protein domain having at least about 10 
amino acid residues of which about 60% of the amino acid residues contain non-polar 
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side chains, for example, alanine, valine, leucine, isoleucine, proline, phenylalanine, 
tryptophan, and methionine. In a preferred embodiment, a "transmembrane domain" 
includes a protein domain having at least about 13, preferably about 16. more preferably 
about 19, and even more preferably about 21, 23. 25. 30, 35 or 40 amino acid residues. 
5 of which at least about 70%, preferably about 80%, and more preferably about 90% of 
the amino acid residues contain non-polar side chains, for example, alanine, valine, 
leucine, isoleucine, proline, phenylalanine, tryptophan, and methionine. A 
transmembrane domain is lipophillic in nature. For example, a transmembrane domain 
can be found at about amino acids 352-370 of SEQ ID NO:23 (Trp352 to Ile370 of the 

1 0 human TRL II amino acid sequence). 

Yet another embodiment of the invention features TRL molecules which contain 
a signal sequence. As used herein, a "signal sequence" includes a peptide of at least 
about 20 amino acid residues in length which occurs at the N-terminus of secretory and 
integral membrane proteins and which contains at least 55% hydrophobic amino acid 

15 residues. In a preferred embodiment, a signal sequence contains at least about 15*45 
amino acid residues, preferably about 20-42 amino acid residues. Signal sequences of 
25-35 amino acid residues and 28-32 amino acid residues are also within the scope of the 
invention. As used herein, a signal sequence has at least about 40-70%, preferably about 
50-65%, and more preferably about 55-60% hydrophobic amino acid residues (e.g., 

20 Alanine, Valine, Leucine. Isoleucine. Phenylalanine, Tyrosine. Tryptophan, or Proline). 
Such a "signal sequence", also referred to in the art as a "signal peptide", serves to direct 
a protein containing such a sequence to a lipid bilayer. For example, a signal sequence 
can be found about amino acids 1-41 of SEQ ID NO:23 (Metl to Ala41 of the human 
TRL II amino acid sequence.) 

25 In a particularly preferred embodiment, the TRL protein and nucleic acid 

molecules of the present invention are human TRL molecules. A nucleotide sequence of 
the isolated human TRL I cDNA and the predicted amino acid sequence of the human 
TRL I protein are shown in Figure 2 and in SEQ ID NOs:3 and 4. respectively. In 
addition, the nucleotide sequence corresponding to the coding region of the human TRL 

30 I cDNA (nucleotides 190-951) is represented as SEQ ID NO:6. 

A 4.2 kb TRL mRNA transcript is expressed in human tissues including heart, 
brain, placenta, lung, liver, skeletal muscle, kidney, pancreas, spleen, thymus, prostate, 
testis, ovary, small intestine, colon, and peripheral blood lymphocytes, with more 
pronounced expression observed in human kidney, brain, placenta, and colon, as 

35 determined by Northern blot analysis. Chromosomal mapping indicates that the human 
TRL gene is located on chromosome 6p25. 
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In situ analysis indicates that human TRL mRNA is expressed in adult tissues as 
follows: human pancreas: multifocal signal in pancreas consistent with islets as the most 
likely source of signal; human tonsil: strong signal in a pattern suggestive of stratified 
squamous epithelial cell layer (pharyngeal mucosa); human brain (section of cortex): no 
5 signal; human spleen: no signal: human skeletal muscle: no signal: human small 
intestine: signal observed and determined to be the Peyef s patches : human colon: 
expression in surface mucosal epithelium (may be in mucus): human kidney - medulla. 

Moreover, TRL mRNA is expressed in the following manner in adult monkey 
tissues: monkey stomach: strong signal observed in the mucosal layer; monkey pancreas: 
10 multifocal signal observed and determined to be the islets. 

The particular expression of TRL mRNA in the pancreas of adult human and 
monkey tissues (e.g.. multifocal signal in the pancreas sections of both species) indicates 
that the TRL proteins of the present invention play a potential role in pancreatic cancer 
and/or diabetes. 

15 The human TRL I cDNA. which is approximately 2612 nucleotides in length, 

encodes a protein having a molecular weight of approximately 27 kDa (excluding post- 
translational modifications) and which is approximately 253 amino acid residues in 
length. The human TRL I protein contains four cysteine-rich domains. A TRL cysteine 
rich domain can be found at least, for example, from about amino acids 47-89 of SEQ ID 

20 NO:4 (Ala47 to Ser89 of the human TRL I amino acid sequence); from about amino 
acids 90-131 of SEQ ID NO:4 (Ser90 to Cysl31 of the human TRL I amino acid 
sequence); from about amino acids 132-169 of SEQ ID NO:4 (Thrl32 to Lysl69 of the 
human TRL I amino acid sequence): and from about amino acids 170-212 of SEQ ID 
NO:4 (Glnl70 to GIy212 of the human TRL I amino acid sequence.) The human TRL I 

25 protein is a secreted protein which lacks a transmembrane domain, however the N- 

terminal cysteine-rich domain shares significant homology with a membrane bound form 
of TRL. The human TRL I protein further contains a signal sequence at about amino 
acids 1-41 of SEQ ID NO:4 (Metl to Ala41 of the human TRL I amino acid sequence.) 
Alternatively, the signal sequence may contain amino acids 1-42 of SEQ ID NO:4. The 

30 prediction of such a signal peptide can.be made utilizing the computer algorithm 
SIGN ALP (Henrik, et aL (1997) Protein Engineering 10:1-6). 

A nucleotide sequence of the isolated human TRL II cDNA and the predicted 
amino acid sequence of the human TRL II protein are shown in Figure 4 and in SEQ ID 
NOs:23 and 24, respectively. In addition, the nucleotide sequence corresponding to the 

35 coding region of the human TRL II cDNA (nucloetides 5 10-2324) is represented as SEQ 
ID NO:25. 



BNSDOCID:<WO 9915663A1> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/15663 



PCT/US98/20219 



The human TRL II cDNA, which is approximately 2638 nucleotides in length, 
encodes a protein having a molecular weight of approximately 66.2 kDa (excluding post- 
radiational modifications) and which is approximately 605 amino acid residues in 
length. A plasmid containing the full length nucleotide sequence encoding human TRL 
5 II (clone designation ephT75L) was deposited under the provisions of the Budapest 
Treaty with the American Type Culture Collection (ATCC), presently in Manassas, 
Virginia, on February 6, 1998 and assigned Accession Number 98649. The human TRL 
II protein contains four cysteine-rich domains. A TRL cysteine rich domain can be 
found at least, for example, from about amino acids 47-89 of SEQ ID NO:23 (Ala47 to 

10 Ser89 of the human TRL II amino acid sequence); from about amino acids 90-131 of 
SEQ ID NO:23 (Ser90 to Cysl31 of the human TRL II amino acid sequence); from 
about amino acids 132-169 of SEQ ID NO:23 (Thrl32 to Lysl69 of the human TRL II 
amino acid sequence): and from about amino acids 170-212 of SEQ ID NO:23 (Glnl70 
to Gly212 of the human TRL II amino acid sequence.) The human TRL II protein 

1 5 contains three potential protein kinase C phosphorylation sites at amino acids 441, 467 
and 506 of SEQ ID NO 23. The human TRL II protein is a membrane-bound protein 
which contains a transmembrane domain at about amino acids 352-370 of SEQ ID 
NO:23 (Trp352 to Ile370 of the human TRL II amino acid sequence). The human TRL 
II protein further contains a signal sequence at about amino acids 1-41 of SEQ ID NO:23 

20 (Metl to Ala41 of the human TRL II amino acid sequence.) Alterantively, the signal 
sequence may contain amino acids 1-42 of SEQ ID NO:4. The prediction of such a 
signal peptide can be made utilizing the computer algorithm SIGNALP (Henrik, et al. 
( 1 997) Protein Engineering 10:1 -6). The C-terminal unique region of human TRL II 
contains a death domain at amino acids 415-498 of SEQ ID NO:23. 

25 In another preferred embodiment, the TRL protein and nucleic acid molecules of 

the present invention are murine TRL molecules. A murine TRL nucleic acid molecule 
was identified from a primary murine megakaryocyte cDNA library (described in further 
detail in Example 1). The nucleotide sequence of the isolated murine TRL cDNA and 
the predicted amino acid sequence of the murine TRL protein are shown in Figure 1 and 

30 in SEQ ID NOs:l and 2, respectively. In addition, the coding region of murine TRL 
cDNA (corresponding to nucleotides 344-2065) is depicted as SEQ ID NO:5. 

A 4.2 kb TRL mRNA transcript is expressed in mouse tissues including heart, 
brain, spleen, lung, liver, skeletal muscle, kidney, and testis, with more pronounced 
expression observed in mouse kidney and brain, as determined by Northern blot analysis. 

35 Chromosomal mapping indicates that the murine TRL gene is located 12cM distal of 
D17MIT48 and 20cM proximal of D17MIT9. This region is syntenic to 6p21 in human. 
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In situ analysis indicates that TRL mRNA is expressed in murine adult tissues as 
follows: brain (cortex): neuronal expression (no glial cell expression); brain 
(cerebellum): purkinje cells (no granule cell expression); brain (hippocampus): at least 
CA1. CA2, and possibly CA3 (no dentate gyrus expression); brainstem; bladder: 
5 epithelial cell layer; transverse colon; descending colon; colon (paraffin): goblet cells; 
kidney: (signal seen in the cortex but not the medulla), proximal convoluted tubules, not 
in glomeruli or medulla; and stomach: glandularregion. 

Moreover, TRL mRNA is expressed in the following manner in murine 
embryonic tissues: brain; liver (signal decreases significantly b/w E14.5 and PI. 5); 
10 trigeminal ganglion; kidney; retina (photoreceptor layer); lung; olfactory 

epithelium;intestines; spinal cord; stomach: glandular region: submaxillary gland; and 
lung. 

The murine TRL gene, which is approximately 3331 nucleotides in length. ^- 
encodes a protein having a molecular weight of approximately 63 kDa (excluding post- ' ? 

15 translational modifications) and which is approximately 573 amino acid residues in 
length. The murine TRL protein contains at least three cysteine-rich domains. A TRL 
cysteine-rich domain can be found at least, for example, from about amino acids 8-49 of 
SEQ ID NO:2 (Ser8 to Cys49 of the murine TRL amino acid sequence); from about 
amino acids 50-87 of SEQ ID NO:2 (Ile50 to Lys87 of the murine TRL amino acid 

20 sequence): from about amino acids 88-130 of SEQ ID NO:2 (Gln88 to Glyl30 of the 
murine TRL amino acid sequence). The murine TRL protein is a membrane-bound 
protein which contains a transmembrane domain at about amino acids 270-288 of SEQ 
ID NO:2 (Trp270 to Ile288 of the murine TRL amino acid sequence). The C-terminal 
unique region of murine TRL contains a death domain at amino acids 333-416 of SEQ 

25 ID NO:2. 

An alignment of the human TRL I and murine TRL amino acid sequences to 
other members of the TNFR superfamily of proteins is presented in Figure 3. The figure 
depicts an alignment of the amino acid sequences of human TRL (corresponding to 
amino acids I to 273 of SEQ ID NO:4), murine TRL (corresponding to amino acids 1 to 

30 573 of SEQ ID NO:2, human TNFR2 precursor (Swiss-Prot™ Accession No. P20333), 
murine TNFR2 precursor (Swiss-Prot™ Accession No. P251 19), soluble TNFR2 
precursor (Swiss-Prot™ Accession No. P25943), human CD40 receptor precursor 
(Swiss-Prot™ Accession No. P25942), murine CD40 receptor precursor (Swiss-Prot™ 
Accession No. P27512), human osteoprotegerin (Swiss-Prot™ Accession No. U94332), 

35 and murine osteoprotegerin (Swiss-Prot™ Accession No. U94331). A signal sequence is 
designated by bold characters. Cysteine-rich domains are designated alternatively by 
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italicized characters or by underlined, italicized characters. A transmembrane domain is 
indicated by bold, underlined characters. 

An alignment of the human TRL L human TRL II, and murine TRL amino acid 
sequences is presented in Figure 5. A signal sequence is designated by bold characters. 
5 Cysteine-rich domains are designated alternatively by italicized characters or by 
underlined, italicized characters. A transmembrane domain is indicated by bold, 
underlined characters. 

Various aspects of the invention are described in further detail in the following 
subsections: 

10 

I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated nucleic acid molecules that 
encode TRL proteins or biologically active portions thereof, as well as nucleic acid 
fragments sufficient for use as hybridization probes to identify TRL-encoding nucleic 

15 acids (e.g., TRL mRNA) and fragments for use as PCR primers for the amplification or 
mutation of TRL nucleic acid molecules. As used herein, the term "nucleic acid 
molecule" is intended to include DNA molecules (e.g.. cDNA or genomic DNA) and 
RNA molecules (e.g., mRNA) and analogs of the DNA or RNA generated using 
nucleotide analogs. The nucleic acid molecule can be single-stranded or double- 

20 stranded, but preferably is double-stranded DNA. 

The term "isolated nucleic acid molecule" includes nucleic acid molecules which 
are separated from other nucleic acid molecules which are present in the natural source 
of the nucleic acid. For example, with regards to genomic DNA. the term "isolated" 
includes nucleic acid molecules which are separated from the chromosome with which 

25 the genomic DNA is naturally associated. Preferably, an "isolated" nucleic acid is free 
of sequences which naturally flank the nucleic acid (i.e.. sequences located at the 5' and 
3' ends. of the nucleic acid) in the genomic DNA of the organism from which the nucleic 
acid is derived. For example, in various embodiments, the isolated TRL nucleic acid 
molecule can contain less than about 5 kb. 4kb, 3kb. 2kb, 1 kb, 0.5 kb or 0.1 kb of 

30 nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA 
of the cell from which the nucleic acid is derived. Moreover, an "isolated" nucleic acid 
molecule, such as a cDNA molecule, can be substantially free of other cellular material, 
or culture medium when produced by recombinant techniques, or substantially free of 
chemical precursors or other chemicals when chemically synthesized. 

35 A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 

having the nucleotide sequence of SEQ ID NO:l. SEQ ID NO:3. SEQ ID NO:22, the 
DNA insert of the plasmid deposited with ATCC as Accession Number 98649, or a 
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portion of these nucleotide molecules, can be isolated using standard molecular biology 
techniques and the sequence information provided herein. Using all or portion of the 
nucleic acid sequences of SEQ ID NO: 1. SEQ ID NO:3. SEQ ID NO:22, or the DNA 
insert of the plasmid deposited with ATCC as Accession Number 98649 as a 
hybridization probe. TRL nucleic acid molecules can be isolated using standard 
hybridization and cloning techniques (e.g.. as described in Sambrook, J., Fritsh, E. F., 
and Maniatis. T. Molecular Cloning: A Laboratory Manual. 2nd, ed. Cold Spring 
Harbor Laboratory. Cold Spring Harbor Laboratory Press. Cold Spring Harbor, NY, 
1989). 

Moreover, a nucleic acid molecule encompassing all or a portion of SEQ ID 
NO:l. SEQ ID NO:3, SEQ ID NO:22, or the DNA insert of the plasmid deposited with 
ATCC as Accession Number 98649 can be isolated by the polymerase chain reaction 
using synthetic oligonucleotide primers designed based upon the sequence of SEQ ID 
NO: 1 . SEQ ID NO:3. SEQ ID NO:22. or the DNA insert of the plasmid deposited with 
ATCC as Accession Number 98649. For example, a portion of SEQ ID NO:3 was 
isolated using oligonucleotide primers T75 pwzf and T75 pwzr. based upon the sequence 
of SEQ ID NO:3 as described in detail in EXAMPLE 5. 

A nucleic acid of the invention can be amplified using cDNA. mRNA or 
alternatively, genomic DNA. as a template and appropriate oligonucleotide primers 
according to standard PCR amplification techniques. The nucleic acid so amplified can 
be cloned into an appropriate vector and characterized by DNA sequence analysis. 
Furthermore, oligonucleotides corresponding to TRL nucleotide sequences can be : 
prepared by standard synthetic techniques, e.g.. using an automated DNA synthesizer. 

In a preferred embodiment, an isolated nucleic acid molecule of the invention 
comprises the nucleotide sequence shown in SEQ ID NO: 1 . The sequence of SEQ ID 
NO: 1 corresponds to the murine TRL cDN A. This cDNA comprises sequences 
encoding the murine TRL protein (i.e.. "the coding region", from nucleotides 344 to 
2065), as well as 5' untranslated sequences (nucleotides 1 to 343) and 3' untranslated 
sequences (nucleotides 2066 to 333 1). Alternatively, the nucleic acid molecule can 
comprise only the coding region of SEQ ID NO: 1 (e.g., nucleotides 344 to 2065). 

In another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises the nucleotide sequence shown in SEQ ID NO:3. The sequence of 
SEQ ID NO:3 corresponds to the human TRL I cDNA. This cDNA comprises 
sequences encoding the human TRL I protein (i.e., "the coding region", from nucleotides 
190 to 951). as well as 5' untranslated sequences (nucleotides 1 to 189) and 3' 
untranslated sequences (nucleotides 951 to 2612). Alternatively, the nucleic acid 
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molecule can comprise only the coding region of SEQ ID NO:3 (e.g., nucleotides 190 to 
951). 

In another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises the nucleotide sequence shown in SEQ ID NO:22. The sequence of 
5 SEQ ID NO:22 corresponds to the human TRL II cDNA. This cDNA comprises 
sequences encoding the human TRL II protein (i.e.. "the coding region", from 
nucleotides 510-2324), as well as 5 ! untranslated sequences (nucleotides 1 to 509) and 3' 
untranslated sequences (nucleotides 2325 to 2638). Alternatively, the nucleic acid 
molecule can comprise only the coding region of SEQ ID NO:22 (e.g., nucleotides 510 
10 to 2324). 

In another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleic acid molecule which is a complement of the nucleotide 
sequence shown in SEQ ID NO: 1 . SEQ ID NO:3. SEQ ID NO:5. SEQ ID NO:6. the 
DNA insert of the plasmid deposited with ATCC as Accession Number 98649, or a 

15 portion of either of these nucleotide sequences. A nucleic acid molecule which is 

complementary to the nucleotide sequence shown in SEQ ID NO:l ? SEQ ID NO: 3. SEQ 
ID NO:5. SEQ ID NO:6, or the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98649 is one which is sufficiently complementary to the nucleotide 
sequence shown in SEQ ID NO: L SEQ ID NO:3. SEQ ID NO:5. SEQ ID NO:6. the 

20 DNA insert of the plasmid deposited with ATCC as Accession Number 98649, such that 
it can hybridize to the nucleotide sequence shown in SEQ ID NO:l. SEQ ID NO:3, SEQ 
ID NO:5, SEQ ID NO:6, the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98649. thereby forming a stable duplex. 

In still another preferred embodiment, an isolated nucleic acid molecule of the 

25 present invention comprises a nucleotide sequence which is at least about 60-65%, 

preferably at least about 70-75%. more preferable at least about 80-85%. and even more 
preferably at least about 90-95% or more homologous to the nucleotide sequence shown 
in SEQ ID NO:l, the nucleotide sequence shown in SEQ ID NO:3, the nucleotide 
sequence shown in SEQ ID NO:5 5 the nucleotide sequence shown in SEQ ID NO:6, the 

30 nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98649, or a portion of any of these nucleotide sequences. 

Moreover, the nucleic acid molecule of the invention can comprise only a portion 
of the nucleic acid sequence of SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID NO:22, or the 
DNA insert of the plasmid deposited with ATCC as Accession Number 98649, (or SEQ 

35 ID NO:5, SEQ ID NO:6, or SEQ ID NO:24) for example a fragment which can be used 
as a probe or primer or a fragment encoding a biologically active portion of TRL. The 
nucleotide sequence determined from the cloning of the murine and human TRL genes 
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allows for the generation of probes and primers designed for use in identifying and/or* 
cloning TRL homologues in other cell types, e.g. from other tissues, as well as TRL 
homologues from other mammals. The probe/primer typically comprises substantially 
purified oligonucleotide. The oligonucleotide typically comprises a region of nucleotide 
5 sequence that hybridizes under stringent conditions to at least about 12. preferably about 
25, more preferably about 40, 50 or 75 consecutive nucleotides of SEQ ID NO:l. SEQ 
ID NO:3 ? SEQ ID NO:22. or the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98649 (or SEQ ID NO:5. SEQ ID NO:6 ; or SEQ ID NO:24) sense, of 
an anti-sense sequence of SEQ ID NO: 1. SEQ ID NO:3, SEQ ID NO:22. or the DNA 

1 0 insert of the plasmid deposited with ATCC as Accession Number 98649. or of a 

naturally occurring mutant of either SEQ ID NO:L SEQ ID NO:3. SEQ ID NO:22. or 
the DNA insert of the plasmid deposited with ATCC as Accession Number 98649 (or 
SEQ ID NO:5. SEQ ID NO:6. or SEQ ID NO:24). 

Probes based on the either the murine or human TRL nucleotide sequences can 

1 5 be used to detect transcripts or genomic sequences encoding the same or homologous 
proteins. In preferred embodiments, the probe further comprises a label group attached 
thereto, e.g. the label group can be a radioisotope, a fluorescent compound, an enzyme, 
or an enzyme co- factor. Such probes can be used as a part of a diagnostic test kit for 
identifying cells or tissue which misexpress a TRL protein, such as by measuring a level 

20 of a TRL-encoding nucleic acid in a sample of cells from a subject e.g.. detecting TRL 
mRNA levels or determining whether a genomic TRL gene has been mutated or deleted. 

A nucleic acid fragment encoding a "biologically active portion of TRL" can be 
prepared by isolating a portion of SEQ ID NO: L SEQ ID NO:3, SEQ ID NO:5. or SEQ 
ID NO:6, which encodes a polypeptide having a TRL biological activity (the biological 

25 activities of the TRL proteins have previously been described), expressing the encoded 
portion of TRL protein (e.g., by recombinant expression in vitro) and assessing the 
activity of the encoded portion of TRL. For example, a nucleic acid fragment encoding a 
biologically active portion of murine TRL encompasses at least nucleic acids 365-490. 
491-604, or 605-783 of SEQ ID NO:l (encoding a murine TRL CRD). Moreover, a 

30 nucleic acid fragment encoding a biologically active portion of murine TRL 

encompasses at least nucleic acids 1 151-1207 of SEQ ID NO:l (encoding a TRL 
transmembrane domain). Alternatively, a nucleic acid fragment encoding a biologically 
active portion of human TRL encompasses at least nucleic acids 328-456, 457-582, 583- 
696, or 697-825 of SEQ ID NO:3 (encoding a human TRL CRD). Moreover, a nucleic 

35 acid fragment encoding a biologically active portion of human TRL encompasses at least 
nucleic acids 190-312 of SEQ ID NO:3 (encoding a TRL signal sequence). 
Alternatively, a nucleic acid fragment encoding a biologically active portion of human 
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TRL II encompasses at least nucleic acids 648-776. 777-902, 903-1016, or 1017-1 145 of 
SEQ ID NO:22 (encoding a human TRL II CRD). Moreover, a nucleic acid fragment 
encoding a biologically active portion of human TRL encompasses at least nucleic acids 
5 10-632 of SEQ ID NO:3 (encoding a TRL signal sequence). 

The invention further encompasses nucleic acid molecules which are degenerate 
sequence variants of the nucleic acid molecules having the nucleotide sequence set forth 
as SEQ ID NO:L SEQ ID NO:3 SEQ ID NO:22. or the DNA insert of the plasmid 
deposited with ATCC as Accession Number 98649. As used herein, a "degenerate 
sequence variant" is a nucleic acid molecule having a sequence that differs from the 
nucleotide sequence shown in SEQ ID NO: 1 . SEQ ID NO:3, SEQ ID NO:22. or the 
DNA insert of the plasmid deposited with ATCC as Accession Number 98649 (and 
portions thereof, e.g.. SEQ ID NO:5, or SEQ ID NO:6) due to degeneracy of the genetic 
code but encodes the same TRL protein as that encoded by the nucleotide sequence 
shown in SEQ ID NO: L SEQ ID NO:3, SEQ ID NO:22. or the DNA insert of the 
plasmid deposited with ATCC as Accession Number 98649. In another embodiment, an 
isolated nucleic acid molecule of the invention has a nucleotide sequence encoding a 
protein having an amino acid sequence shown in SEQ ID NO:2. SEQ ID NO:4, or SEQ 
ID NO:23. 

In addition to the murine and human TRL nucleotide sequences shown in SEQ 
ID NO:L SEQ ID NO:3 ? and SEQ ID NO:22. it will be appreciated by those skilled in 
the art that DNA sequence polymorphisms that lead to changes in the amino acid 
sequences of TRL may exist within a population (e.g.. the human population). Such 
genetic polymorphism in the TRL gene may exist among individuals within a population 
due to natural allelic variation. As used herein, the terms "gene" and "recombinant gene" 
refer to nucleic acid molecules which include an open reading frame encoding a TRL 
protein, preferably a mammalian TRL protein, and can further include non-coding 
regulatory sequences, and introns. 

Allelic variants of human TRL include both functional and non-functional TRL 
proteins. Functional allelic variants are naturally occurring amino acid sequence variants 
of the human TRL protein that maintain a TRL biological activity, as described 
previously. Functional allelic variants will typically contain only conservative 
substitution of one or more amino acids of SEQ ID NO:2, SEQ ID NO: 4, or SEQ ID 
NO:23, or substitution, deletion or insertion of non-critical residues in non-critical 
regions of the proteins. 

Non-functional allelic variants are naturally occurring amino acid sequence 
variants of the human TRL protein that do not have one or more biological activities of a 
TRL protein. Non-functional allelic variants will typically contain a non-conservative 



SUBSTITUTE SHEET (RULE 26) 



WO 99/15663 



PCT7US98/20219 



substitution, a deletion, or insertion or premature truncation of the amino acid sequence 
of SEQ ID NO:2, SEQ ID NO: 4. or SEQ ID NO:23. or a substitution, insertion or 
deletion in critical residues or critical regions. 

The present invention further provides non-human orthologues of the human and 
5 mouse TRL proteins of the present invention. Orthologues of the human and mouse 
TRL proteins of the present invention are proteins that are isolated from other organisms 
and and posess at least one of the biological activities of the mouse or human TRL 
protein. Orthologues can readily be identified as including an amino acid sequence that 
is substantially homologous to SEQ ID NO:2, SEQ ID NO:4. or SEQ ID NO:23, as set 
10 forth herein. 

Moreover, nucleic acid molecules encoding other TRL family members, and thus 
which have a nucleotide sequence which differs from the murine and human sequences 
of SEQ ID NO:K SEQ ID NO:3. or SEQ ID NO:22. are intended to be within the scope 
of the invention. 

15 Nucleic acid molecules corresponding to allelic variants, homoiogues. and/or 

orthologues of the TRL cDNAs of the invention can be isolated based on their homology 
to the murine or human TRL nucleic acids disclosed herein using the murine or human 
cDNAs. or a portion of either sequence, as a hybridization probe according to standard 
hybridization techniques under stringent hybridization conditions. For example, a 

20 soluble murine TRL cDNA can be isolated based on its homology to murine membrane- 
bound or human soluble TRL. Likewise, a membrane-bound human TRL cDNA can be 
isolated based on its homology to soluble human TRL or murine membrane bound TRL. 
Nucleic acid molecules corresponding to allelic variants, homoiogues. and/or orthologies 
of the TRL cDNAs of the invention can further be isolated by mapping to the same • 

25 chromosome or locus as the TRL gene. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 15 nucleotides in length and hybridizes under stringent conditions to 
the nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO: 1 . In other 
embodiment, the nucleic acid is at least 30. 50. 100. 250 or 500 nucleotides in length. 

30 As used herein, the term "hybridizes under stringent conditions" is intended to describe 
conditions for hybridization and washing under which nucleotide sequences at least 60% 
homologous to each other typically remain hybridized to each other. Preferably, the 
conditions are such that sequences at least about 65%. more preferably at least about 
70%. and even more preferably at least about 75% homologous to each other typically 

35 remain hybridized to each other. Such stringent conditions are known to those skilled in 
the art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, 
N.Y. (1989). 6.3.1-6.3.6. A preferred, non-limiting example of stringent hybridization 
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conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2 X SSC, 0. 1% SDS at 50-65°C. Preferably, an 
isolated nucleic acid molecule of the invention that hybridizes under stringent conditions 
to the sequence of SEQ ID NO: 1 , SEQ ID NO:3. or SEQ IN NO:22 corresponds to a 
naturally-occurring nucleic acid molecule. As used herein, a "naturally-occurring" 
nucleic acid molecule refers to an RNA or DNA molecule having a nucleotide sequence 
that occurs in nature (e.g., encodes a natural protein). 

In addition to naturally-occurring allelic variants of the TRL sequence that may 
exist in the population, the skilled artisan will further appreciate that changes can be 
introduced by mutation into the nucleotide sequence of SEQ ID NO:l. SEQ ID NO:3. or 
SEQ ID NO:2 thereby leading to changes in the amino acid sequence of the encoded 
TRL protein, without altering the functional ability of the TRL protein. For example, 
nucleotide substitutions leading to amino acid substitutions at "non-essential" amino acid 
residues can be made in the sequence of SEQ ID NO: 1 . SEQ ID NO:3. SEQ ID NO:22, 
or the DNA insert of the plasmid deposited with ATCC as Accession Number 98649. A 
"non-essential" amino acid residue is a residue that can be altered from the wild-type 
sequence of TRL (e.g., the sequence of SEQ ID NO:2. SEQ ID NO:4. or SEQ ID 
NO:23) without altering the biological activity, whereas an "essential" amino acid 
residue is required for biological activity. For example, amino acid residues of TRL that 
are conserved among the murine and human family members of this invention (as 
indicated by the alignment and comparison of the amino acid sequences of SEQ ID 
NOs:2 and 4 presented as Figure 3 or by the alignment and comparison of the amino acid 
sequences of SEQ ID NO:s 2. 4. and 23 presented as Figure 5) are predicted to be 
essential in TRL and thus are not likely to be amenable to alteration. Furthermore, 
amino acid residues that are conserved among the TRL proteins of the present invention, 
as well as among the TNFR2, OPG, and CD40 protein families (as indicated by the 
alignment presented as Figure 3) are predicted to be particularly unamenable to alteration 
(For example, all proteins of the TNFR2, OPG, and CD40 families, as well as the TRL 
proteins of the present invention, contain at least four cysteine residues among the CRR 
domains (the CRD domains for murine and human TRL have been previously described, 
the CRD domains of the TNFR2, OPG. and CD40 families are as follows: amino acids 
39-76, 77-118, 119-162, and 163-201 of huTNFR2; amino acids 39-77, 78-119, 120- 
164, 165-203 of muTNFR2; amino acids 27-62, 63-104, 105-147, and 148-186 of 
soluble TNFR2; amino acids 25-60, 61-103, 104-144, and 145-187 of huCD40R; amino 
acids 25-60, 61-103. 104-144, and 145-187 of muCD40R; and amino acids 22-64, 65- 
105, 106-142. and 143-194 of both huOPG and muOPG. 
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In addition, the amino acid sequence of human TRL I has two predicted N- 
glycosylation sites corresponding to amino acids 82-84 and 141-143 of SEQ ID NO:4 
and the amino acid sequence of human TRL II amino acid sequence has 6 predicted N- 
glycosylation sites corresponding to amino acids 82-84, 141-143, 252-254, 257-259 r 
5 278-280. and 289-291. The human TRL II amino acid sequence further has three 
putative protein kinase C phosphorylation sites corresponding to amino acids Ser44L 
Thr467, and Thr506 and the murine TRL amino acid sequence has four putative protein 
kinase C phosphorylation sites corresponding to amino acids Ser29K Thr 294, Thr 385 ; 
and Thr 424 which are not likely to be amenable to alteration. Furthermore, the death 

10 domains of human TRL II and murine TRL have conserved amino acid residues as 
indicated in Figure 6 which are not likely to be amenable to alteration. 

Moreover, structure/function and crystallographic analyses of various members 
of the TNFR superfamily have identified residues and/or regions that are important for 
the activity of these proteins. Thus, these highly conserved regions in TNFR 

1 5 superfamily proteins are not likely to be amenable to mutation. Other amino acid 
residues, however, (e.g.. those that are not conserved or only semi-conserved among 
members of the TNFR superfamily) may not be essential for activity and thus are likely 
to be amenable to alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 

20 encoding TRL proteins that contain changes in amino acid residues that are not essential 
for activity. Such TRL proteins differ in amino acid sequence from SEQ ID NO:2/SEQ 
ID NO:4. or SEQ ID NO:23 yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule comprises a nucleotide sequence encoding a protein/ 
wherein the protein comprises an amino acid sequence at least about 60% homologous to 

25 the amino acid sequence of SEQ ID NO:2. SEQ ID NO:4, or SEQ ID NO:23. 

Preferably, the protein encoded by the nucleic acid molecule is at least about 70% 
homologous to SEQ ID NO:2. SEQ ID NO:4, or SEQ ID NO:23, more preferably at 
least about 80% homologous to SEQ ID NO:2 r SEQ ID NO:4, or SEQ ID NO:23 ? even 
more preferably at least about 90% homologous to SEQ ID NO:2 ? SEQ ID NO:4, or 

30 SEQ ID NO:23, and most preferably at least about 95% homologous to SEQ ID NO:2, 
SEQ ID NO:4. or SEQ ID NO:23. 

An isolated nucleic acid molecule encoding a TRL protein homologous to the 
protein of SEQ ID NO:2. SEQ ID NO:4 ? or SEQ ID NO:23 can be created by 
introducing one or more nucleotide substitutions, additions or deletions into the 

35 nucleotide sequence of SEQ ID NO: 1 s SEQ ID NO:3, SEQ ID NO:22 9 or the DNA insert 
of the plasmid deposited with ATCC as Accession Number 98649 such that one or more 
amino acid substitutions, additions or deletions are introduced into the encoded protein. 
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Mutations can be introduced into SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID NO:22. or the 
DNA insert of the plasmid deposited with ATCC as Accession Number 98649 by 
standard techniques, such as site-directed mutagenesis and PCR-mediated mutagenesis. 
Preferably, conservative amino acid substitutions are made at one or more predicted non- 
5 essential amino acid residues. A "conservative amino acid substitution" is one in which 
the amino acid residue is replaced with an amino acid residue having a similar side 
chain. Families of amino acid residues having similar side chains have been defined in 
the an. These families include amino acids with basic side chains (e.g., lysine, arginine, 
histidine). acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side 

10 chains (e.g., glycine, asparagine, glutarnine, serine, threonine, tyrosine, cysteine), 

nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, 
methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) 
and aromatic side chains (e.g.. tyrosine, phenylalanine, tryptophan, histidine). Thus, a 
predicted nonessential amino acid residue in TRL is preferably replaced with another 

15 amino acid residue from the same side chain family. Alternatively, in another 

embodiment, mutations can be introduced randomly along all or part of a TRL coding 
sequence, such as by saturation mutagenesis, and the resultant mutants can be screened 
for TRL biological activity activity to identify mutants that retain activity. Following 
mutagenesis of SEQ ID NO:l. SEQ ID NO:3. SEQ ID NO:5. or SEQ ID NO:6. the 

20 encoded protein can be expressed recombinantly and the activity of the protein can be 
determined. 

In a preferred embodiment, a mutant TRL protein can be assayed for (1) the 
ability to form proteimprotein interactions with other TNFR superfamily proteins, other 
cell-surface proteins, or biologically active portions thereof; (2) complex formation 

25 between a mutant TRL protein and a TRL ligand; (3) the ability of a mutant TRL protein 
to bind to an intracellular target protein or biologically active portion thereof; (e.g. SH2 
domain-containing proteins or cytoskeletal proteins). In yet another preferred 
embodiment, a mutant TRL can be assayed for the ability to (1 ) modulate cellular signal 
transduction; (2) regulate cellular proliferation; (3) regulate cellular differentiation; (4) 

30 regulate cell survival or apoptosis; (5) modulate a cell involved in the immune response; 
and (6) modulate a cell involved in a metabolic response (e.g., a pancreatic islet cell). 

In addition to the nucleic acid molecules encoding TRL proteins described above, 
another aspect of the invention pertains to isolated nucleic acid molecules which are 
antisense thereto. An "antisense" nucleic acid comprises a nucleotide sequence which is 

35 complementary to a "sense" nucleic acid encoding a protein, e.g., complementary to the 
coding strand of a double-stranded cDNA molecule or complementary to an mRNA 
sequence. Accordingly, an antisense nucleic acid can hydrogen bond to a sense nucleic 
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acid. The antisense nucleic acid can be complementary to an entire TRJL coding strand, 
or to only a portion thereof. In one embodiment, an antisense nucleic acid molecule is 
antisense to a "coding region" of the coding strand of a nucleotide sequence encoding 
TRL. The term "coding region" refers to the region of the nucleotide sequence 
5 comprising codons which are translated into amino acid residues (e.g., the coding region 
of murine TRL corresponds to SEQ ID NO:5 and the coding region of human TRL 
corresponds to SEQ ID NO:6). In another embodiment, the antisense nucleic acid 
molecule is antisense to a "noncoding region" of the coding strand of a nucleotide 
sequence encoding TRL. The term "noncoding region" refers to 5' and 3' sequences 

10 which flank the coding region that are not translated into amino acids (i.e., also referred 
to as 5' and 3' untranslated regions). 

Given the coding strand sequences encoding TRL disclosed herein (e.g.. SEQ ID 
NO:5. SEQ ID NO:6, or SEQ ID NO:24). antisense nucleic acids of the invention can be 
designed according to the rules of Watson and Crick base pairing. The antisense nucleic 

1 5 acid molecule can be complementary to the entire coding region of TRL mRNA. but 
more preferably is an oligonucleotide which is antisense to only a portion of the coding 
or noncoding region of TRL mRNA. For example, the antisense oligonucleotide can be 
complementary to the region surrounding the translation start site of TRL mRNA. An 
antisense oligonucleotide can be. for example, about 5. 10, 15. 20, 25. 30, 35, 40. 45 or 

20 50 nucleotides in length. An antisense nucleic acid of the invention can be constructed 
using chemical synthesis and enzymatic ligation reactions using procedures known in the 
art. For example, an antisense nucleic acid (e.g.. an antisense oligonucleotide) can be 
chemically synthesized using naturally occurring nucleotides or variously modified 1 "* 
nucleotides designed to increase the biological stability of the molecules or to increase 

25 the physical stability of the duplex formed between the antisense and sense nucleic acids, 
e.g., phosphorothioate derivatives and acridine substituted nucleotides can be used. 
Examples of modified nucleotides which can be used to generate the antisense nucleic 
acid include 5 -fluoro uracil, 5-bromouracil, 5-chlorouraciL 5-iodouracil, hypoxanthine, 
xantine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5- 

30 carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluraciL 
dihydrouracil, beta-D-galactosylqueosine, inosine. N6-isopentenyladenine, 1- 
methylguanine, 1-methylinosine, 2.2-dimethylguanine. 2-methyladenine, 2- 
methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 

35 mannosylqueosine, 5'-methoxycarboxymethyluracil. 5-methoxyuracil. 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxyacetic acid (v). wybutoxosine, pseudouracil, queosine. 
2-thiocytosine. 5-methyl-2-thiouraciL 2-thiouracil. 4-thiouracil, 5-methyluraciL uracil-5- 
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oxyacetic acid methylester, uracil-5-oxyacetic acid (v). 5-methyI-2-thiouraciL 3-(3- 
amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2.6-diaminopurine. Alternatively, the 
antisense nucleic acid can be produced biologically using an expression vector into 
which a nucleic acid has been subcloned in an antisense orientation (i.e., RNA 
^ 5 transcribed from the inserted nucleic acid will be of an antisense orientation to a target 
nucleic acid of interest, described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered 
to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 
and/or genomic DNA encoding a TRL protein to thereby inhibit expression of the 

10 protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or. for example, in the 
case of an antisense nucleic acid molecule which binds to DNA duplexes, through 
specific interactions in the major groove of the double helix. An example of a route of 
administration of antisense nucleic acid molecules of the invention include direct 

15 injection at a tissue site. Alternatively, antisense nucleic acid molecules can be modified 
to target selected cells and then administered systemically. For example, for systemic 
administration, antisense molecules can be modified such that they specifically bind to 
receptors or antigens expressed on a selected cell surface, e.g.. by linking the antisense 
nucleic acid molecules to peptides or antibodies which bind to cell surface receptors or 

20 antigens. The antisense nucleic acid molecules can also be delivered to cells using the 
vectors described herein. To achieve sufficient intracellular concentrations of the 
antisense molecules, vector constructs in which the antisense nucleic acid molecule is 
placed under the control of a strong pol II or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention 

25 is an a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual P-units, the strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids. 
Res. 15:6625-6641). The antisense nucleic acid molecule can also comprise a 2'-o- 
methylribonucleotide (Inoue et al. (1987) Nucleic Acids Res. 15:6131-6148) or a 

30 chimeric RNA-DNA analogue (Inoue et al. (1987) FEBS Lett. 215:327-330). 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity which are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA. to which they have 
a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes (described in 

35 Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to catalytically cleave 
TRL mRNA transcripts to thereby inhibit translation of TRL mRNA. A ribozyme 
having specificity for a TRL-encoding nucleic acid can be designed based upon the 
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nucleotide sequence of a TRL cDNA disclosed herein (i.e., SEQ ID NO:l). For 
example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in which the 
nucleotide sequence of the active site is complementary to the nucleotide sequence to be 
cleaved in a TRL-encoding mRNA. See, e.g., Cech et al. U.S. Patent No. 4,987,071 ; and 
Cech et al. U.S. Patent No. 5,1 16,742. Alternatively, TRL mRNA can be used to select a 
catalytic RNA having a specific ribonuclease activity from a pool of RNA molecules. 
See, e.g., Bartel. D. and Szostak, J.W. (1993) Science 261:1411-1418. 

Alternatively, TRL gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the TRL (e.g.,. the TRL promoter 
and/or enhancers) to form triple helical structures that prevent transcription of the TRL 
gene in target cells. See generally, Helene. C. ( 1991 ) Anticancer Drug Des. 6(6):569-84; 
Helene. C. et al. (1992) Ann. N. Y. Acad Sci. 660:27-36; and Maher, L.J. (1992) 
Bioassays 14(12):807-15. 

In preferred embodiments, the nucleic acids of TRL can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, 
hybridization, or solubility of the molecule. For example, the deoxyribose phosphate 
backbone of the nucleic acids can be modified to generate peptide nucleic acids (see 
Hyrup B. et al (1996) Bioorganic & Medicinal Chemistry 4(1): 5-23). As used herein, 
the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., DNA 
mimics, in which the deoxyribose phosphate backbone is replaced by a pseudopeptide 
backbone and only the four natural nucleobases are retained. The neutral backbone of 
PNAs has been shown to allow for specific hybridization to DNA and RNA under -■ 
conditions of low ionic strength. The synthesis of PNA oligomers can be performed 
using standard solid phase peptide synthesis protocols as described in Hyrup B. et al 
(1996) supra: Perry-O'Keefe et al. PNAS 93: 14670-675. 

PNAs of TRL can be used therapeutic and diagnostic applications. For example, 
PNAs can be used as antisense or antigene agents for sequence-specific modulation of 
gene expression by, e.g., inducing trascription or translation arrest or inhibiting 
replication. PNAs of TRL can also be used, e.g., in the analysis of single base pair 
mutations in a gene by, e.g., PNA directed PCR clamping; as 'artificial restriction 
enzymes when used in combination with other enzymes, e.g., SI nucleases (Hyrup B. 
(1996) supra): or as probes or primers for DNA sequence and hybridization (Hyrup B. et 
al (1996) supra: Perry-O'Keefe supra). 

In another embodiment, PNAs of TRL can be modified, e.g., to enhance their 
stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
delivery known in the art. For example, PNA-DNA chimeras of TRL can be generated 
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which may combine the advantageous properties of PNA and DNA. Such chimeras 
allow DNA recognition enzymes, e.g., RNAse H and DNA polymerases, to interact with 
the DNA portion while the PNA portion would provide high binding affinity and 
specificity. PNA-DNA chimeras can be linked using linkers of appropriate lengths 
5 selected in terms of base stacking, number of bonds between the nucleobases, and 
orientation (Hyrup B. (1996) supra). The synthesis of PNA-DNA chimeras can be 
performed as described in Hyrup B. (1996) supra and Finn P.J, et al (1996) Nucleic 
Acids Research 24 (17): 3357-63. For example, a DNA chain can be synthesized on a 
solid support using standard phosphoramidite coupling chemistry and modified 

10 nucleoside analogs, e.g., 5 , -(4-methoxytrityl)amino-5 , -deoxy-thymidine 

phosphoramidite, can be used as a between the PNA and the 5' end of DNA (Mag. M et 
al. (1989) Nucleic Acid Res. 17: 5973-88). PNA monomers are then coupled in a 
stepwise manner to produce a chimeric molecule with a 5' PNA segment and a 3' DNA 
segment (Finn P. J. et al. (1996) supra). Alternatively, chimeric moleclues can be 

15 synthesized with a 5' DNA segment and a 3' PNA segment (Petersen K.H. et al. (1975) 
Bioorganic Med. Chem. Lett. 5: 1119-11 124). 

In other embodiments, the oligonucleotide may include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
transport across the cell membrane (see, e.g.. Letsinger et al., 1989. Proc. Natl. Acad. 

20 Sci. U.S.A. 86:6553-6556; Lemaitre et al.. 1987. Proc. Natl. Acad. Sci. 84:648-652; PCT 
Publication No. W088/09810. published December 15. 1988) or the blood-brain barrier 
(see, e.g., PCT Publication No. W089/10134, published April 25. 1988). In addition, 
oligonucleotides can be modified with hybridization-triggered cleavage agents (See. e.g.. 
Krol et al., 1988, BioTechniques 6:958-976) or intercalating agents. (See. e.g., Zon. 

25 1988. Pharm. Res. 5:539-549). To this end. the oligonucleotide may be conjugated to 
another molecule, e.g., a peptide, hybridization triggered cross-linking agent, transport 
agent, hybridization-triggered cleavage agent, etc. 

IL Isolated TRL Proteins and Anti-TRL Antibodies 

30 One aspect of the invention pertains to isolated TRL proteins,, and biologically 

active portions thereof, as well as polypeptide fragments suitable for use as immunogens 
to raise anti-TRL antibodies. In one embodiment, native TRL proteins can be isolated 
from cells or tissue sources by an appropriate purification scheme using standard protein 
purification techniques. In another embodiment, TRL proteins are produced by 

35 recombinant DNA techniques. Alternative to recombinant expression, a TRL protein or 
polypeptide can be synthesized chemically using standard peptide synthesis techniques. 
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An "isolated" or "purified" protein or biologically active portion thereof is 
substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the TRL protein is derived, or substantially free from chemical 
precursors or other chemicals when chemically synthesized. The language "substantially 
5 free of cellular material" includes preparations of TRL protein in which the protein is 
separated from cellular components of the ceils from which it is isolated or 
recombinantly produced. In one embodiment, the language "substantially free of cellular 
material" includes preparations of TRL protein having less than about 30% (by dry 
weight) of non-TRL protein (also referred to herein as a "contaminating protein"), more 

10 preferably less than about 20% of non-TRL protein, still more preferably less than about 
10% of non-TRL protein, and most preferably less than about 5% non-TRL protein. 
When the TRL protein or biologically active portion thereof is recombinantly produced, 
it is also preferably substantially free of culture medium, i.e.. culture medium represents 
less than about 20%, more preferably less than about 10%, and most preferably less than 

1 5 about 5% of the volume of the protein preparation. 

The language "substantially free of chemical precursors or other chemicals" 
includes preparations of TRL protein in which the protein is separated from chemical 
precusors or other chemicals which are involved in the synthesis of the protein. In one 
embodiment, the language "substantially free of chemical precursors or other chemicals" 

20 includes preparations of TRL protein having less than about 30% (by dry weight) of 
chemical precursors or non-TRL chemicals, more preferably less than about 20% 
chemical precursors or non-TRL chemicals, still more preferably less than about 10% 
chemical precursors or non-TRL chemicals, and most preferably less than about 5%' 
chemical precursors or non-TRL chemicals. 

25 Biologically active portions of a TRL protein include peptides comprising amino 

acid sequences sufficiently homologous to or derived from the amino acid sequence of 
the TRL protein, e.g., the amino acid sequence shown in SEQ ID NO:2. SEQ ID NO:4, 
or SEQ ID NO:23, which include less amino acids than the full length TRL proteins, and 
exhibit at least one activity of a TRL protein. Typically, biologically active portions 

30 comprise a domain or motif with at least one activity of the TRL protein. A biologically 
active portion of a TRL protein can be a polypeptide which is, for example, 10, 25, 50, 
100 or more amino acids in length. 

In one embodiment, a biologically active portion of a TRL protein comprises at 
least one CRD characteristic of the TNFR superfamily of proteins. In another 

35 embodiment, a biologically active portion of a TRL protein comprises at least a 

transmembrane domain. In yet another embodiment, a biologically active portion of a 
TRL protein comprises at least a signal sequence. 
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In an alternative embodiment, a biologically active portion of a TRL protein 
comprises at least a C-terminal unique domain of a TRL protein. In another 
embodiment, a biologically active portion of a TRL protein comprises at least a C- 
terminal unique domain which contains a death domain. In another embodiment, a 
biologically active portion of a TRL protein comprises at least a death domain. In yet 
another embodiment, a biologically active portion of a TRL protein comprises the N- 
terminal portion of a TRL molecule containing at least one. preferably two, more 
preferably three and even more preferably four CRDs, but is missing a substantial 
portion of the TRL C-terminal unique domain. Such a preferred TRL molecule is 
referred to as a "TRL extracellular domain". For example, preferred TRL extracellular 
domains contain at least about amino acids 1-290. 1-270. 50-270, 88-270, or 131-270 of 
SEQ ID NO:2. at least about amino acids 1-212. 48-2 1 2. 90-2 12, 1 3 1 -2 1 2 of SEQ ID 
NO:4. or at least about 1-370. 1-351.47-370. 90-370. 132-370. or 170-370. 

It is to be understood that a preferred biologically active portion of a TRL 
protein of the present invention may contain at least one of the above-identified 
structural domains. A more preferred biologically active portion of a TRL protein may 
contain at least two of the above-identified structural domains. An even more preferred 
biologically active portion of a TRL protein may contain at least three of the above- 
identified structural domains. A particularly preferred biologically active portion of a 
TRL protein of the present invention may contain at least four of the above-identified 
structural domains. A more particularly preferred biologically active portion of a TRL 
protein may have at least five of the above-identified structural domains. Finally, a most 
preferred biologically active portion of a TRL protein may contain at least six of the 
above-identified structural domains. 

Moreover, other biologically active portions, in which other regions of the 
protein are deleted, can be prepared by recombinant techniques and evaluated for one or 
more of the functional activities of a native TRL protein. 

In a preferred embodiment, the TRL protein has an amino acid sequence shown 
in SEQ ID NO:2. SEQ ID NO:4, or SEQ ID NO:23. In other embodiments, the TRL 
protein is substantially homologous to SEQ ID NO:2. SEQ ID NO:4, or SEQ ID NO:23 
and retains the functional' activity of the protein of SEQ ID NO:2, SEQ ID NO:4, or SEQ 
ID NO:23 yet differs in amino acid sequence due to natural allelic variation or 
mutagenesis, as described in detail in subsection II below. Accordingly, in another 
embodiment, the TRL protein is a protein which comprises an amino acid sequence at 
least about 60% homologous to the amino acid sequence of SEQ ID NO:2, SEQ ID 
NO:4. or SEQ ID NO:23 and retains the functional activity of the TRL proteins of SEQ 
ID NO:2. SEQ ID NO:4. or SEQ ID NO:23. Preferably, the protein is at least about 70% 
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homologous to SEQ ID NO:2, SEQ ID NO:4. or SEQ ID NO:23, more preferably at 
least about 80% homologous to SEQ ID NO:2. SEQ ID NO:4. or SEQ ID NO:23, even 
more preferably at least about 90% homologous to SEQ ID NO 2 or SEQ ID NO:4, and 
most preferably at least about 95% or more homologous to SEQ ID NO:2, SEQ ID 
5 NO:4,orSEQIDNO:23. 

To determine the percent homology of two amino acid sequences or of two 
nucleic acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can 
be introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 
alignment with a second amino or nucleic acid sequence and non-homologous sequences 

10 can be disregarded for comparison purposes). In a preferred embodiment, the length of a 
reference sequence aligned for comparison purposes is at least 30%, preferably at least 
40%. more preferably at least 50%. even more preferably at least 60%, and even more 
preferably at least 70% r 80%. or 90% of the length of the reference sequence (e.g., when 
aligning a second sequence to the TRL amino acid sequence of SEQ ID NO:2 having 

1 5 • 573 amino acid residues, at least 1 72, preferably at least 229, more preferably at least 

287. even more preferably at least 344. and even more preferably at least 401. 458 or 5 16 
amino acid residues are aligned). The amino acid residues or nucleotides at 
corresponding amino acid positions or nucleotide positions are then compared. When a 
position in the first sequence is occupied by the same amino acid residue or nucleotide as 

20 the corresponding position in the second sequence, then the molecules are homologous at 
that position (i.e.. as used herein amino acid or nucleic acid "homology 11 is equivalent to 
amino acid or nucleic acid "identity"). The percent homology between the two 
sequences is a function of the number of identical positions shared by the sequences (i.e., 
% homology = # of identical positions/total # of positions x 100). 

25 The comparison of sequences and determination of percent homology between 

two sequences can be accomplished using a mathematical algorithim. A preferred, non- 
limiting example of a mathematical algorithim utilized for the comparison of sequences 
is the algorithm of Karlin and Altschul (1990) Proc. Natl. Acad. Sci. USA 87:2264-68, 
modified as in Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-77. Such 

30 an algorithm is incorporated into the NBLAST and XBLAST programs (version 2.0) of 
Altschul. et al. (1990) J. Mol. Biol. 215:403-10. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, wordlength = 12 to obtain 
nucleotide sequences homologous to TRL nucleic acid molecules of the invention. 
BLAST protein searches can be performed with the XBLAST program, score = 50, 

35 wordlength = 3 to obtain amino acid sequences homologous to TRL protein molecules of 
the invention. To obtain gapped alignments for comparison purposes. Gapped BLAST 
can be utilized as described in Altschul et al., (1997) Nucleic Acids Research 
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25(17):3389-3402. When utilizing BLAST and Gapped BLAST programs, the default 
parameters of the respective programs (e.g., XBLAST and NBLAST) can be used. See 
http://w-ww.ncbi.nlm.nih.gov. Another preferred, non-limiting example of a 
mathematical algorithim utilized for the comparison of sequences is the algorithm of 
Myers and Miller. CABIOS (1989). Such an algorithm is incorporated into the ALIGN 
program ( version 2.0) which is part of the GCG sequence alignment software package. 
When utilizing the ALIGN program for comparing amino acid sequences, a PAM120 
weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be used. 

Another preferred, non-limiting example of a mathematical algorithim utilized 
for the alignment of protein sequences is the Lipman-Pearson algorithm (Lipman and 
Pearson (1985) Science 227:1435-1441). When using the Lipman-Pearson algorithm, a 
PAM250 weight residue table, a gap length penaltv of 12. a gap penalty of 4, and a 
Ktuple of 2 can be used. A preferred, non-limiting example of a mathematical 
algorithim utilized for the alignment of nucleic acid sequences is the Wilbur-Lipman 
algorithm (Wilbur and Lipman (1983) Proc. Natl. Acad Sci. USA 80:726-730). When 
using the Wilbur-Lipman algorithm, a window of 20. gap penalty of 3. Ktuple of 3 can 
be used. Both the Lipman-Pearson algorithm and the Wilbur-Lipman algorithm are 
incorporated, for example, into the MEGALIGN program (e.g.. version 3.1.7) which is 
part of the DNASTAR sequence analysis software package. 

Additional algorithims for sequence analysis are known in the art, and include 
ADVANCE and ADAM, described in Torelli and Robotti (1994) Comput. AppL Biosci. 
10:3-5: and FASTA, described in Pearson and Lipman (1988) P.N.A.S. 85:2444-8. 

In another preferred embodiment, the percent homology between two amino acid 
sequences can be accomplished using the GAP program in the GCG software package 
(available at http://www.gcg.com). using either a Blossom 62 matrix or a PAM250 
matrix, and a gap weight of 12, 10, 8, 6, or 4 and a length weight of 2, 3, or 4. In yet 
another preferred embodiment, the percent homology between two nucleic acid 
sequences can be accomplished using the GAP program in the GCG software package 
(available at http://www.gcg.com), using a gap weight of 50 and a length weight of 3. 

The invention also provides TRJL chimeric or fusion proteins. As used herein, a 
TRL "chimeric protein" or "fusion protein" comprises a TRL polypeptide operatively 
linked to a non-TRL polypeptide. A "TRL polypeptide" refers to a polypeptide having 
an amino acid sequence corresponding to TRL, whereas a "non-TRL polypeptide" refers 
to a polypeptide having an amino acid sequence corresponding to a protein which is not 
substantially homologous to the TRL protein, e.g.. a protein which is different from the 
TRL protein and which is derived from the same or a different organism. Within a TRL 
fusion protein the TRL polypeptide can correspond to all or a portion of a TRL protein. 
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In a preferred embodiment, a TRJL fusion protein comprises at least one biologically 
active portion of a TRL protein. In another preferred embodiment, a TRL fusion protein 
comprises at least two biologically active portions of a TRL protein. In another 
preferred embodiment, a TRL fusion protein comprises at least three biologically active 
portions of a TRL protein. Within the fusion protein, the term "operatively linked" is 
intended to indicate that the TRL polypeptide and the non-TRL polypeptide are fused in- 
frame to each other. The non-TRL polypeptide can be fused to the N-terminus or C- 
terminus of the TRL polypeptide. 

For example, in one embodiment a TRL fusion protein comprises the 
extracellular domain of a TRL protein operably linked to the intercellular portion of a 
second protein known to be involved in cellular signaling. In another embodiment, a 
TRL fusion protein comprises a TRL C-terminal unique domain opreably linked to the 
extracellular domain of a second protein known to be involved in cellular signaling. 
Such fusion proteins can be further utilized in screening assays for compounds which 
modulate TRL activity (such assays are described in detail below). 

In yet another embodiment, the fusion protein is a GST-TRL fusion protein in 
which the TRL sequences are fused to the C-terminus of the GST sequences. Such 
fusion proteins can facilitate the purification of recombinant TRL. 

In another embodiment, the fusion protein is a TRL protein containing a 
heterologous signal sequence at its N-terminus. For example, the native TRL signal 
sequence (i.e, about amino acids 1 to 41 or amino acids 1-42 of SEQ ID NO:4) can be 
removed and replaced with a signal sequence from another protein. In certain hostcells 
(e.g.. mammalian host cells), expression and/or secretion of TRL can be increased 
through use of a heterologous signal sequence. 

In another embodiment, the fusion protein is a protein containing a TRL signal 
sequence, operatively linked at least to the N-terminal end of a protein which is poorly 
secreted form the intracellular millieu (e.g., the human OB receptor). For example, the 
N-terminal 42 amino acid residues of human TRL (e.g., about amino acid residues 1-42) 
can be linked to the N-terminal end of the mature OB receptor, or a protion thereof, (e.g., 
the extracellular domain). This fusion can further be linked at the C-terminus, for 
example, to human IgG Fc domain. Such a fusion protein is described further in 
Example 8. 

In yet another embodiment, the fusion protein is a TRL-immunoglobulin fusion 
protein in which the TRL sequences comprising primarily the TNFR-like CRD are fused 
to sequences derived from a member of the immunoglobulin protein family. Soluble 
derivatives have also been made of cell surface glycoproteins in the immunoglobulin 
gene superfamily consisting of an extracellular domain of the cell surface glycoprotein 
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fused to an immunoglobulin constant (Fc) region (see e.g., Capon, D.J. et al. (1989) 
Nature 337:525-531 and Capon U.S. Patents 5,1 16,964 and 5.428.130 [CD4-IgGl 
constructs]; Linsley, P.S. et al (1991)7. Exp. Med, 173:721-730 [aCD28-IgGl 
construct and a B7-l-IgGl construct]; and Linsley, P.S. et ai (1991)7. Exp. Med 
174:561-569 and U.S. Patent 5,434,131 [a CTLA4-IgGl]). Such fusion proteins have 
proven useful for modulating receptor-ligand interactions. Soluble derivatives of cell 
surface proteins of the TNFR superfamily proteins have been made consisting of an 
extracellular domain of the cell surface receptor fused to an immunoglobulin constant 
(Fc) region (See for example Moreland et al. (1997) N. Engl. J. Med. 337(3): 14 1 - 1 47; 
van der Poll et al. (1997) Blood 89(10):3727-3734; and Ammann et al. (1997) J. Clin. 
Invest. 99(7):1699-1703.) 

The TRL-immunoglobulin fusion proteins of the invention can be incorporated 
into pharmaceutical compositions and administered to a subject to inhibit an interaction 
between a TRL ligand and a TRL protein on the surface of a cell, to thereby suppress 
TRL-mediated signal transduction in vivo. The TRL-immunoglobulin fusion proteins 
can be used to affect the bioavailability of a TRL cognate ligand. Inhibition of the TRL 
ligand/TRL interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, as well as modulating (e.g. promoting or 
inhibiting) cell survival. Moreover, the TRL-immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce anti-TRL antibodies in a subject, to 
purify TRL ligands and in screening assays to identify molecules which inhibit the 
interaction of TRL with a TRL ligand. 

Preferably, a TRL chimeric or fusion protein of the invention is produced by 
standard recombinant DNA techniques. For example. DNA fragments coding for the 
different polypeptide sequences are ligated together in-frame in accordance with 
conventional techniques, for example by employing blunt-ended or stagger-ended 
termini for ligation, restriction enzyme digestion to provide for appropriate termini, 
filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid 
undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene can 
be synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively, PCR amplification of gene fragments can be carried out using anchor 
primers which give rise to complementary overhangs between two consecutive gene 
fragments which can subsequently be annealed and reamplified to generate a chimeric 
gene sequence (see, for example. Current Protocols in Molecular Biology, eds. Ausubel 
et al. John Wiley & Sons: 1992). Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST polypeptide). A TRL- 



SUBSTITUTE SHEET (RULE 26) 



WO 99/15663 



PCT7US98/20219 



- 35 - 

encoding nucleic acid can be cloned into such an expression vector such that the fusion 
moiety is linked in-frame to the TRL protein. 

The present invention also pertains to variants of the TRL proteins which 
function as either TRL agonists (mimetics) or as TRL antagonists. Variants of the TRL 
5 protein can be generated by mutagenesis, e.g., discrete point mutation or truncation of 
the TRL protein. An agonist of the TRL protein can retain substantially the same, or a 
subset, of the biological activities of the naturally occurring form of the TRL protein. 
An antagonist of the TRL protein can inhibit one or more of the activities of the naturally 
occurring form of the TRL protein by, for example, competitively binding to a 

10 downstream or upstream member of a cellular signaling cascade which includes the TRL 
protein. Thus, specific biological effects can be elicited by treatment with a variant of 
limited function. In one embodiment, treatment of a subject with a variant having a 
subset of the biological activities of the naturally occurring form of the protein has fewer 
side effects in a subject relative to treatment with the naturally occurring form of the 

15 TRL proteins. 

In one. variants of the TRL protein which function as either TRL agonists 
(mimetics) or as TRL antagonists can be identified by screening combinatorial libraries 
of mutants, e.g., truncation mutants, of the TRL protein for TRL protein agonist or 
antagonist activity. In one embodiment, a variegated library of TRL variants is 

20 generated by combinatorial mutagenesis at the nucleic acid level and is encoded by a 
variegated gene library. A variegated library of TRL variants can be produced by, for 
example, enzymatically ligating a mixture of synthetic oligonucleotides into gene . 
sequences such that a degenerate set of potential TRL sequences is expressible as ^ 
individual polypeptides, or alternatively, as a set of larger fusion proteins (e.g., for phage 

25 display) containing the set of TRL sequences therein. There are a variety of methods 
which can be used to produce libraries of potential TRL variants from a degenerate 
oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be 
performed in an automatic DNA synthesizer, and the synthetic gene then ligated into an 
appropriate expression vector. Use of a degenerate set of genes allows for the provision, 

30 in one mixture, of all of the sequences encoding the desired set of potential TRL 

sequences. Methods for synthesizing degenerate oligonucleotides are known in the art 
(see, e.g., Narang, S.A. (1983) Tetrahedron 39:3; Itakura et al. (1984) Anmt. Rev. 
Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid 
Res. 11:477. 

35 In addition, libraries of fragments of the TRL protein coding sequence can be 

used to generate a variegated population of TRL fragments for screening and subsequent 
selection of variants of a TRL protein. In one embodiment, a library of coding sequence 
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fragments can be generated by treating a double stranded PCR fragment of a TRL coding 
sequence with a nuclease under conditions wherein nicking occurs only about once per 
molecule, denaturing the double stranded DNA. renaturing the DNA to form double 
stranded DNA which can include sense/antisense pairs from different nicked products, 
removing single stranded portions from reformed duplexes by treatment with SI 
nuclease, and ligating the resulting fragment library into an expression vector. By this 
method, an expression library can be derived which encodes N-terminal. C-terminal and 
internal fragments of various sizes of the TRL protein. 

Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. Such techniques are adaptable for 
rapid screening of the gene libraries generated by the combinatorial mutagenesis of TRL 
proteins. The most widely used techniques, which are amenable to high through-put 
analysis, for screening large gene libraries typically include cloning the gene library into 
replicable expression vectors, transforming appropriate cells with the resulting library of 
vectors, and expressing the combinatorial genes under conditions in which detection of a 
desired activity facilitates isolation of the vector encoding the gene whose product was 
detected. Recrusive ensemble mutagenesis (REM), a new technique which enhances the 
frequency of functional mutants in the libraries, can be used in combination with the 
screening assays to identify TRL variants (Arkin and Yourvan (1992) PIVAS 89:781 1- 
7815; Delgrave et al. (1993) Protein Engineering 6(3):327-33 1). 

In one embodiment, cell based assays can be exploited to analyze a variegated 
TRL library-. For example, a library of expression vectors can be transfected into a cell 
line which ordinarily responds to a particular cytokine in a TRL-dependent manner. The 
transfected cells are then contacted with the cytokine and the effect of expression of the 
mutant on signaling by the cytokine can be detected, e.g. by measuring NF-kB activity 
or cell survival. Plasmid DNA can then be recovered from the cells which score for 
inhibition, or alternatively, potentiation of cytokine induction, and the individual clones 
further characterized. 

An isolated TRL protein, or a portion or fragment thereof, can be used as an 
immunogen to generate antibodies that bind TRL using standard techniques for 
polyclonal and monoclonal antibody preparation. The full-length TRL protein can be 
used or. alternatively, the invention provides antigenic peptide fragments of TRL for use 
as immunogens. The antigenic peptide of TRL comprises at least 8 amino acid residues 
of the amino acid sequence shown in SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:23 
and encompasses an epitope of TRL such that an antibody raised against the peptide 
forms a specific immune complex with TRL. Preferably, the antigenic peptide 
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comprises at least 10 amino acid residues, more preferably at least 15 amino acid 
residues, even more preferably at least 20 amino acid residues, and most preferably at 
least 30 amino acid residues. 

Preferred epitopes encompassed by the antigenic peptide are regions of TRL that 
5 are located on the surface of the protein, e.g., hydrophilic regions. A hydrophobicity 
analysis of the murine TRL protein sequence indicates 3 hydrophilic regions that are 
preferred for use as antigenic peptides: amino acid residues 10-35, amino acid residues 
70-95, and amino acid residues 190-220 of SEQ ID NO: 2. A hydrophobicity analysis of 
the human TRL I protein sequence indicates 3 hydrophilic regions that are preferred for 

10 use as antigenic peptides: amino acid residues 91-112, amino acid residues 51-75, and 
amino acid residues 224-250 of SEQ ID NO: 4. 

A TRL immunogen typically is used to prepare antibodies by immunizing a 
suitable subject, (e.g., rabbit, goat, mouse or other mammal) with the immunogen. An 
appropriate immunogenic preparation can contain, for example, recombinantly expressed 

1 5 TRL protein or a chemically synthesized TRL polypeptide. The preparation can further 
include an adjuvant, such as Freund's complete or incomplete adjuvant, or similar 
immunostimulatory agent. Immunization of a suitable subject with an immunogenic 
TRL preparation induces a polyclonal anti-TRL antibody response. 

Accordingly, another aspect of the invention pertains to anti-TRL antibodies. 

20 The term "antibody" as used herein refers to immunoglobulin molecules and 

immunologically active portions of immunoglobulin molecules, i.e., molecules that 
contain an antigen binding site which specifically binds (immunoreacts with) an antigen, 
such as TRL. Examples of immunologically active portions of immunoglobulin " 
molecules include F(ab) and F(ab')2 fragments which can be generated by treating the 

25 antibody with an enzyme such as pepsin. The invention provides polyclonal and 

monoclonal antibodies that bind TRL. The term "monoclonal antibody" or "monoclonal 
antibody composition", as used herein, refers to a population of antibody molecules that 
contain only one species of an antigen binding site capable of immunoreacting with a 
particular epitope of TRL. A monoclonal antibody composition thus typically displays a 

30 single binding affinity for a particular TRL protein with which it immunoreacts. 

Polyclonal anti-TRL antibodies can be prepared as described above by 
immunizing a suitable subject with a TRL immunogen. The anti-TRL antibody titer in 
the immunized subject can be monitored over time by standard techniques, such as with 
an enzyme linked immunosorbent assay (ELISA) using immobilized TRL. If desired, 

35 the antibody molecules directed against TRL can be isolated from the mammal (e.g., 
from the blood) and further purified by well known techniques, such as protein A 
chromatography to obtain the IgG fraction. At an appropriate time after immunization. 

SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9915663A1> 



WO 99/15663 



PCT/US98/20219 



-38- 

e.g., when the anti-TRL antibody titers are highest, antibody-producing cells can be 
obtained from the subject and used to prepare monoclonal antibodies by standard 
techniques, such as the hybridoma technique originally described by Kohler and Milstein 
(1975) Nature 256:495-497) (see also, Brown et al. (1981) J. Immunol 127:539-46: 
5 Brown etal. (1980)7. Biol. Chem .255:4980-83; Yeh et al. (1976) PNAS 76:2927-31 ; 
and Yeh et al. (1982) Int. J. Cancer 29:269-75), the more recent human B cell 
hybridoma technique (Kozbor et al. (1983) Immunol Today 4:72), the EBV-hybridoma 
technique (Cole et ai. (1985), Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, 
Inc., pp. 77-96) or trioma techniques. The technology for producing monoclonal 

10 antibody hybridomas is well known (see generally R. H. Kenneth, in Monoclonal 

Antibodies: A New Dimension In Biological Analyses. Plenum Publishing Corp., New- 
York, New York (1980); E. A. Lerner(1981) Yale J. Biol Med. 54:387-402; M. L. 
Gefter et al. (1977) Somatic Cell Genet. 3:23 1-36). Briefly, an immortal cell line 
(typically a myeloma) is fused to lymphocytes (typically splenocytes) from a mammal 

1 5 immunized with a TRL immunogen as described above, and the culture supernatants of 
the resulting hybridoma cells are screened to identify a hybridoma producing a 
monoclonal antibody that binds TRL. 

Any of the many well known protocols used for fusing lymphocytes and 
immortalized cell lines can be applied for the purpose of generating an anti-TRL 

20 monoclonal antibody (see, e.g.. G. Galfre et al. (1977) Nature 266:55052; Gefter et al. 
Somatic Cell Genet., cited supra: Lerner, Yale J. Biol. Med., cited supra; Kenneth, 
Monoclonal Antibodies, cited supra). Moreover, the ordinarily skilled worker will 
appreciate that there are many variations of such methods which also would be useful. 
Typically, the immortal cell line (e.g., a myeloma cell line) is derived from the same 

25 mammalian species as the lymphocytes. For example, murine hybridomas can be made 
by fusing lymphocytes from a mouse immunized with an immunogenic preparation of 
the present invention with an immortalized mouse cell line. Preferred immortal cell lines 
are mouse myeloma cell lines that are sensitive to culture medium containing 
hypoxanthine. aminopterin and thymidine ("HAT medium"). Any of a number of 

30 myeloma cell lines can be used as a fusion partner according to standard techniques, e.g., 
the P3-NSl/l-Ag4-l, P3-x63-Ag8.653 or Sp2/0-Agl4 myeloma lines. These myeloma 
lines are available from ATCC. Typically, HAT-sensitive mouse myeloma cells are 
fused to mouse splenocytes using polyethylene glycol ("PEG"). Hybridoma cells 
resulting from the fusion are then selected using HAT medium, which kills unfused and 

35 unproductive ly fused myeloma cells (unfused splenocytes die after several days because 
they are not transformed). Hybridoma cells producing a monoclonal antibody of the 
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invention are detected by screening the hybridoma culture supernatants for antibodies 
that bind TRL, e.g., using a standard ELISA assay. 

Alternative to preparing monoclonal antibody-secreting hybridomas, a 
monoclonal anti-TRL antibody can be identified and isolated by screening a recombinant 
combinatorial immunoglobulin library (e.g., an antibody phage display library) with 
TRL to thereby isolate immunoglobulin library members that bind TRL. Kits for 
generating and screening phage display libraries are commercially available (e.g.. the 
Pharmacia Recombinant Phage Antibody System, Catalog No. 27-9400-01; and the 
Stratagene Sur/ZAP™ Phage Display Kit, Catalog No. 240612). Additionally, examples 
of methods and reagents particularly amenable for use in generating and screening 
antibody display library can be found in, for example, Ladner et al. U.S. Patent No. 
5,223,409: Kang et al. PCT International Publication No. WO 92/18619; Dower et al. 
PCT International Publication No. WO 91/17271; Winter et al. PCT International 
Publication WO 92/20791 ; Markland et al. PCT International Publication No. WO 
92/15679; Breitling et al. PCT International Publication WO 93/01288; McCafferty et al. 
PCT International Publication No. WO 92/01047; Garrard et al. PCT International 
Publication No. WO 92/09690; Ladner et al. PCT International Publication No. WO 
90/02809; Fuchs et al. ( 1 99 1 ) Bio/Technology 9:1370-1372; Hay et al. (1992) Hum. 
Antibod. Hybridomas 3:81-85; Huse et al. (1989) Science 246: 1275-1281 ; Griffiths et al. 
(1993) EMBO J 12:725-734: Hawkins et al. (1992) J. Mol Biol 226:889-896: Clarkson 
et al. (1991) Nature 352:624-628; Gram et al. (1992) PNAS 89:3576-3580; Garrad et al. 
(1991) Bio/Technology 9:1373-1377: Hoogenboom et al. (1991) Nuc. Acid Res. 19:4133- 
4137; Barbas et al. (1991 ) PNAS 88:7978-7982; and McCafferty et al. Nature (1990) ' 
348:552-554. 

Additionally, recombinant anti-TRL antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be 
made using standard recombinant DNA techniques, are within the scope of the 
invention. Such chimeric and humanized monoclonal antibodies can be produced by 
recombinant DNA techniques known in the art, for example using methods described in 
Robinson et al. International Application No. PCT/US86/02269; Akira, et al. European 
Patent Application 184/187; Taniguchi, M. ? European Patent Application 171,496; 
Morrison et al. European Patent Application 173.494; Neuberger et al. PCT International 
Publication No. WO 86/01533; Cabilly et al. U.S. Patent No. 4,816,567; Cabilly et al. 
European Patent Application 125.023; Better et al. (1988) Science 240:1041-1043; Liu et 
al. (1987) /W/fS 84:3439-3443; Liu et al. (1987)7. Immunol 139:3521-3526; Sun et al. 
(1987) PNAS 84:214-21 8; Nishimura et al. (1987) Cane. Res. 47:999-1005; Wood et al. 
(1985) Nature 314:446-449; and Shaw et al. (1988) J. Natl Cancer Inst. 80:1553-1559): 
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Morrison, S. L. (1985) Science 229:1202-1207; Oi et al. (1986) BioTechniques 4:214; 
Winter U.S. Patent 5,225,539; Jones et al. (1986) Nature 321:552-525; Verhoeyan et aL 
(1988) Science 239:1534; and Beidler et al. (1988) J. Immunol 141:4053-4060. 

An anti-TRL antibody (e.g., monoclonal antibody) can be used to isolate TRL by 
standard techniques, such as affinity chromatography or immunoprecipitation. An anti- 
TRL antibody can facilitate the purification of natural TRL from cells and of 
recombinantly produced TRL expressed in host cells. Moreover, an anti-TRL antibody 
can be used to detect TRL protein (e.g., in a cellular lysate or cell supernatant) in order to 
evaluate the abundance and pattern of expression of the TRL protein. Anti-TRL 
antibodies can be used diagnostically to monitor protein levels in tissue as part of a 
clinical testing procedure, e.g., to. for example, determine the efficacy of a given 
treatment regimen. Detection can be facilitated by coupling (i.e., physically linking) the 
antibody to a detectable substance. Examples of detectable substances include various 
enzymes, prosthetic groups, fluorescent materials, luminescent materials, bioluminescent 
materials, and radioactive materials. Examples of suitable enzymes include horseradish 
peroxidase, alkaline phosphatase, (3-galactosidase, or acetylcholinesterase; examples of 
suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin: 
examples of suitable fluorescent materials include umbelliferone, fluorescein, fluorescein 
isothiocyanate. rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or 
phycoerythrin; an example of a luminescent material includes luminol; examples of 
bioluminescent materials include luciferase. luciferin, and aequorin. and examples of 
suitable radioactive material include 125 I, 13 1 L 35 S or 3 H. 

III. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding TRL (or a portion thereof). As used herein, the term 
"vector" refers to a nucleic acid molecule capable of transporting another nucleic acid to 
which it has been linked. One type of vector is a "plasmid". which refers to a circular 
double stranded DNA loop into which additional DNA segments can be ligated. 
Another type of vector is a viral vector, wherein additional DNA segments can be ligated 
into the viral genome. Certain vectors are capable of autonomous replication in a host 
cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors). Other vectors (e.g., non-episomal 
mammalian vectors) are integrated into the genome of a host cell upon introduction into 
the host cell, and thereby are replicated along with the host genome. Moreover, certain 
vectors are capable of directing the expression of genes to which they are operatively 
linked. Such vectors are are referred to herein as "expression vectors". In general, 
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expression vectors of utility in recombinant DNA techniques are often in the form of 
plasmids. In the present specification, "plasmid" and "vector" can be used 
interchangeably as the plasmid is the most commonly used form of vector. However, the 
invention is intended to include such other forms of expression vectors, such as viral 
5 vectors (e.g.. replication defective retroviruses, adenoviruses and adeno-associated 
viruses), which serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of 
the invention in a form suitable for expression of the nucleic acid in a host cell, which 
means that the recombinant expression vectors include one or more regulatory 

10 sequences, selected on the basis of the host cells to be used for expression, which is 
operatively linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 
of the nucleotide sequence (e.g.. in an in vitro transcription/translation system or in a 

1 5 host cell when the vector is introduced into the host cell). The term "regulatory 

sequence" is intended to includes promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel; Gene Expression Technology: Methods in Enzymology 185, 
Academic Press. San Diego. CA (1990). Regulatory sequences include those which 

20 direct constitutive expression of a nucleotide sequence in many types of host cell and 
those which direct expression of the nucleotide sequence only in certain host cells (e.g.. 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of the 
host cell to be transformed, the level of expression of protein desired, etc. The 

25 expression vectors of the invention can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., TRL proteins, mutant forms of TRL, fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed for 
expression of TRL in prokaryotic or eukaryotic cells. For example. TRL can be 

30 expressed in bacterial cells such as E. coli, insect cells (using baculovirus expression 
vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in 
Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, 
San Diego, CA (1990). Alternatively, the recombinant expression vector can be 
transcribed and translated in v/Yro, for example using T7 promoter regulatory sequences 

35 and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in E. coli with 
vectors containing constitutive or inducible promotors directing the expression of either 
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fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase expression of recombinant protein; 
2) to increase the solubility of the recombinant protein; and 3) to aid in the purification 
5 of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from 
the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and 
their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 

10 Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, D.B. 
and Johnson, K.S. (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) 
and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), 
maltose E binding protein, or protein A, respectively, to the target recombinant protein. 
In a preferred embodiment, exemplified in Example 5. the coding sequence of 

15 human TRL (i.e., encompassing amino acids 1 to 253) is cloned into a pCD5 expression 
vector to create a vector encoding a TRL-Ig fusion protein. In an alternative preferred 
embodiment, also exemplified in Example 5. the coding sequence of a form of human 
TRL lacking the signal sequence (i.e., encompassing amino acids 42 to 253) is cloned 
into a pPicZ expression vector (InVitrogen) downstream and in frame with a yeast- 

20 derived signal sequence. In yet another preferred embodiment, also exemplified in 
Example 5, the coding sequence of human TRL (i.e.. encompassing amino acids 1 to 
253) is cloned into a retroviral expression vector. pWZLBlastEC. The fusion proteins 
can be purified utilizing methods well known in the art of protein purification. Purified 
fusion proteins can be utilized in TRL activity assays, in TRL ligand binding (e.g. direct 

25 assays or competitive assays described in detail below), to generate antibodies specific 
for TRL proteins, as examples. In a preferred embodiment, a TRL fusion expressed in a 
retroviral expression vector of the present invention can be utilized to infect bone 
marrow cells which are subsequently transplanted into irradiated recipients. The 
pathology of the subject recipient is then examined after sufficient time has passed (e.g 

30 six (6) weeks). 

Examples of suitable inducible non-fusion E. coli expression vectors include 
pTrc (Amann et ah, (1988) Gene 69:301-315) and pET 1 Id (Studier et al., Gene 
Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, 
California (1990) 60-89). Target gene expression from the pTrc vector relies on host 

35 RNA polymerase transcription from a hybrid trp-lac fusion promoter. Target gene 
expression from the pET 1 Id vector relies on transcription from a T7 gnlO-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase (T7 gnl). This viral 

SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 9915663A1> 



WO 99/15663 



PCT/US98/20219 



-43 - 

polymerase is supplied by host strains BL21(DE3) or HMS1 74(DE3) from a resident X 
prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 
promoter. 

One strategy to maximize recombinant protein expression in E. coli is to express 
5 the protein in a host bacteria with an impaired capacity to proteolytically cleave the 

recombinant protein (Gottesman, S.. Gene Expression Technology: Methods in 

Enzymology 185, Academic Press, San Diego, California (1990) 1 19-128). Another 

strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 

expression vector so that the individual codons for each amino acid are those 
10 preferentially utilized in E. coli (Wada et aL, (1992) Nucleic Acids Res. 20:21 1 1-2118). 

Such alteration of nucleic acid sequences of the invention can be carried out by standard 

DNA synthesis techniques. 

In another embodiment, the TRL expression vector is a yeast expression vector. 

Examples of vectors for expression in yeast S. cerivisae include pYepSecl (Baldari. et 
15 aL, (1987) Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 30:933-943), 

pJRY88 (Schultz et aL, (1987) Gene 54:1 13-123), pYES2 (Invitrogen Corporation, San 

Diego, CA), and picZ (InVitrogen Corp. San Diego, CA). 

Alternatively, TRL can be expressed in insect cells using baculovirus expression 

vectors. Baculovirus vectors available for expression of proteins in cultured insect cells 
20 (e.g.. Sf 9 cells) include the pAc series (Smith et al. (1983) Mol Cell Biol 3:2156-2165) 

and the pVL series (Lucklow and Summers (1989) Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 

mammalian cells using a mammalian expression vector. Examples of mammalian" 1 

expression vectors include pCDM8 (Seed. B. (1987) Nature 329:840) and pMT2PC 
25 (Kaufman et al. (1987) EMBO J. 6:187-195). When used in mammalian cells, the 

expression vector's control functions are often provided by viral regulatory elements. 

For example, commonly used promoters are derived from polyoma. Adenovirus 2, 

cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 

prokaryotic and eukaryotic cells see chapters 16 and 17 of Sambrook, J., Fritsh, E. F., 
30 and Maniatis. T. Molecular Cloning: A Laboratory Manual 2nd, ed. f Cold Spring 

Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 

1989. 

In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell type 
35 (e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et aL 
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(1987) Genes Dev. 1:268-277). lymphoid-specific promoters (Calame and Eaton (1988) 
Adv. Immunol. 43:235-275), in particular promoters of T cell receptors (Winoto and 
Baltimore (1989) EMBO J. 8:729-733) and immunoglobulins (Banerji et al. (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters 
5 (e.g., the neurofilament promoter; Byrne and Ruddle (1989) PNAS 86:5473-5477), 

pancreas-specific promoters (Edlund et al. (1985) Science 230:912-916), and mammary 
gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 4.873316 and 
European Application Publication No. 264,166). Developmentally-regulated promoters 
are also encompassed, for example the murine hox promoters (Kessel and Gruss (1990) 

10 Science 249:374-379) and the a-fetoprotein promoter (Campes and Tilghman (1989) 
Genes Dev. 3:537-546). 

The invention further provides a recombinant expression vector comprising a 
DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in 

15 a manner which allows for expression (by transcription of the DNA molecule) of an 
RNA molecule which is antisense to TRL mRNA. Regulatory sequences operatively 
linked to a nucleic acid cloned in the antisense orientation can be chosen which direct the 
continuous expression of the antisense RNA molecule in a variety of cell types, for 
instance viral promoters and/or enhancers, or regulatory sequences can be chosen which 

20 direct constitutive, tissue specific or cell type specific expression of antisense RNA. The 
antisense expression vector can be in the form of a recombinant plasmid, phagemid or 
attenuated virus in which antisense nucleic acids are produced under the control of a high 
efficiency regulatory region, the activity of which can be determined by the cell type into 
which the vector is introduced. For a discussion of the regulation of gene expression 

25 using antisense genes see Weintraub. H. et al., Antisense RNA as a molecular tool for 
genetic analysis. Reviews - Trends in Genetics. Vol. 1(1) 1986. 

Another aspect of the invention pertains to host cells into which a TRL nucleic 
acid molecule of the invention is introduced, e.g., a TRL nucleic acid molecule within a 
recombinant expression vector or a TRL nucleic acid molecule in a form suitable for 

30 homologous recombination in the genome of a host cell (e.g., a TRL nucleic acid 

molecule which includes TRL nucleotide sequences and additional 5' and 3' flanking 
sequences necessary for homologous recombination). The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such terms 
refer not only to the particular subject cell but to the progeny or potential progeny of 

35 such a cell. Because certain modifications may occur in succeeding generations due to 
either mutation or environmental influences, such progeny may not, in fact, be identical 
to the parent cell, but are still included within the scope of the term as used herein. 
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A host cell can be any prokaryotic or eukaryotic cell. For example. TRL protein 
can be expressed in bacterial cells such as E. coli^ insect cells, yeast or mammalian cells 
(such as Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are 
known to those skilled in the art. 
5 Vector DNA can be introduced into prokaryotic or eukaryotic cells via 

conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid (e.g.. DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation. DEAE-dextran-mediated 

10 transfection, lipofection, or electroporation. Suitable methods for transforming or 
transfecting host cells can be found in Sambrook, et al. (Molecular Cloning; A 
Laboratory Manual. 2nd, ed.. Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press. Cold Spring Harbor. NY. 1989) ? and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 

15 expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 
selectable markers include those which confer resistance to drugs, such as G418, 

20 hygromycin and methotrexate. Nucleic acid encoding a selectable marker can be 

introduced into a host cell on the same vector as that encoding TRL or can be introduced 
on a separate vector. Cells stably transfected with the introduced nucleic acid can be 
identified by drug selection (e.g.. cells that have incorporated the selectable markeFgene 
will survive, while the other cells die). 

25 A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 

culture, can be used to produce (i.e., express) TRL protein. Accordingly, the invention 
further provides methods for producing TRL protein using the host cells of the invention. 
In one embodiment, the method comprises culturing the host cell of invention (into 
which a recombinant expression vector encoding TRL has been introduced) in a suitable 

30 medium such that TRL protein is produced. In another embodiment, the method further 
comprises isolating TRL from the medium or the host cell. 
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The host cells of the invention can also be used to produce nonhuman transgenic 
animals. For example, in one embodiment a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which TRL-coding sequences have been 
introduced. Such host cells can then be used to create non-human transgenic animals in 
5 which exogenous TRL sequences have been introduced into their genome or 

homologous recombinant animals in which endogenous TRL sequences have been 
altered. Such animals are useful for studying the function and/or activity of TRL and for 
identifying and/or evaluating modulators of TRL activity. As used herein, a "transgenic 
animal" is a non-human animal, preferably a mammal, more preferably a rodent such as 

10 a rat or mouse, in which one or more of the cells of the animal includes a transgene. 
Other examples of transgenic animals include non-human primates, sheep, dogs, cows, 
goats, chickens, amphibians, etc. A transgene is exogenous DNA which is integrated 
into the genome of a cell from which a transgenic animal develops and which remains in 
the genome of the mature animal, thereby directing the expression of an encoded gene 

1 5 product in one or more cell types or tissues of the transgenic animal. As used herein, a 
"homologous recombinant animal" is a non-human animal, preferably a mammal, more 
preferably a mouse, in which an endogenous TRL gene has been altered by homologous 
recombination between the endogenous gene and an exogenous DNA molecule 
introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior to 

20 development of the animal. 

A transgenic animal of the invention can be created by introducing TRL^ 
encoding nucleic acid into the male pronuclei of a fertilized oocyte, e.g., by 
microinjection, retroviral infection, and allowing the oocyte to develop in a 
pseudopregnant female foster animal. The human TRL cDNA sequence of SEQ ID 

25 NO:l, SEQ ID NO:22, or the DNA insert of the plasmid deposited with ATCC as 

Accession Number 98649 can be introduced as a transgene into the genome of a non- 
human animal. Alternatively, a nonhuman homologue of the human TRL gene, such as 
the mouse TRL gene can be used as a transgene. Intronic sequences and polyadenylation 
signals can also be included in the transgene to increase the efficiency of expression of 

30 the transgene. A tissue-specific regulatory sequence(s) can be operably linked to the 
TRL transgene to direct expression of TRL protein to particular cells. Methods for 
generating transgenic animals via embryo manipulation and microinjection, particularly 
animals such as mice, have become conventional in the art and are described, for 
example, in U.S. Patent Nos. 4,736,866 and 4,870,009, both by Leder et ah, U.S. Patent 

35 No. 4,873,191 by Wagner et al. and in Hogan, B., Manipulating the Mouse Embryo, 
(Cold Spring Harbor Laboratory Press. Cold Spring Harbor, N.Y., 1986). Similar 
methods are used for production of other transgenic animals. A transgenic founder 
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animal can be identified based upon the presence of the TRL transgene in its genome 
and/or expression of TRL mRNA in tissues or cells of the animals. A transgenic founder 
animal can then be used to breed additional animals carrying the transgene. Moreover, 
transgenic animals carrying a transgene encoding TRL can further be bred to other 
5 transgenic animals carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains 
at least a portion of a TRL gene into which a deletion, addition or substitution has been 
introduced to thereby alter, e.g., functionally disrupt, the TRL gene. The TRL gene can 
be a human gene (e.g., the cDNA of SEQ ID NO:3 or SEQ ID NO:22), but more 

10 preferably, is a non-human homologue of a human TRL gene. For example, a mouse 
TRL gene of SEQ ID NO:l can be used to construct a homologous recombinant nucleic 
acid molecule, e.g., a vector, suitable for altering an endogenous TRL gene in the mouse 
genome. In a preferred embodiment, the homologous recombinant nucleic acid molecule 
is designed such that, upon homologous recombination, the endogenous TRL gene is 

1 5 functionally disrupted (i.e., no longer encodes a functional protein; also referred to as a 
"knock out" vector). Alternatively, the homologous recombinant nucleic acid molecule 
can be designed such that, upon homologous recombination, the endogenous TRL gene 
is mutated or otherwise altered but still encodes functional protein (e.g., the upstream 
regulatory region can be altered to thereby alter the expression of the endogenous TRL 

20 protein). In the homologous recombination vector, the altered portion of the TRL gene 
is flanked at its 5' and 3' ends by additional nucleic acid of the TRL gene to allow for 
homologous recombination to occur between the exogenous TRL gene carried by the 
vector and an endogenous TRL gene in a cell, e.g., an embryonic stem cell. The 
additional flanking TRL nucleic acid is of sufficient length for successful homologous 

25 recombination with the endogenous gene. Typically, several kilobases of flanking DNA 
(both at the 5' and 3' ends) are included in the homologous recombinant nucleic acid 
molecule (see e.g., Thomas, K.R. and Capecchi, M. R. (1987) Cell 51 :503 for a 
description of homologous recombination vectors). The homologous recombinant 
nucleic acid molecule is introduced into a cell, e.g., an embryonic stem cell line (e.g., by 

30 electroporation) and cells in which the introduced TRL gene has homologously 

recombined with the endogenous TRL gene are selected (see e.g., Li, E. et al. (1992) 
Cell 69:915). The selected cells are then injected into a blastocyst of an animal (e.g., a 
mouse) to form aggregation chimeras (see e.g., Bradley, A. in Teratocarcinomas and 
Embryonic Srem Cells: A Practical Approach, E.J. Robertson, ed. (IRL, Oxford, 1987) 

35 pp. 1 13-152). A chimeric embryo can then be implanted into a suitable pseudopregnant 
female foster animal and the embryo brought to term. Progeny harboring the 
homologously recombined DNA in their germ cells can be used to breed animals in 
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which all cells of the animal contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing homologous recombination 
nucleic acid molecules, e.g., vectors, or homologous recombinant animals are described 
further in Bradley, A. (1991) Current Opinion in Biotechnology 2:823-829 and in PCT 
5 International Publication Nos.: WO 90/1 1354 by Le Mouellec et aL; WO 91/01 140 by 
Smithies et aL; WO 92/0968 by Zijlstra et ah; and WO 93/04169 by Berns et al. 

In another embodiment, transgenic non-humans animals can be produced which 
contain selected systems which allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PL For a 

10 description of the cre/loxP recombinase system, see. e.g., Lakso et al. (1992) PNAS 
89:6232-6236, Another example of a recombinase system is the FLP recombinase 
system of Saccharomyces cerevisiae (O'Gorman et al. (1991) Science 251:1351-1355. If 
a cre/loxP recombinase system is used to regulate expression of the transgene, animals 
containing transgenes encoding both the Cre recombinase and a selected protein are 

1 5 required. Such animals can be provided through the construction of "double" transgenic 
animals, e.g., by mating two transgenic animals, one containing a transgene encoding a 
selected protein and the other containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be 
produced according to the methods described in Wilmut, I. et al. (1997) Nature 385:810- 

20 813. In brief, a cell, e.g., a somatic cell, from the transgenic animal can be isolated and 
induced to to exit the growth cycle and enter G 0 phase. The quiescent cell can then be 
fused, e.g., through the use of electrical pulses, to an enucleated oocyte from an animal 
of the same species from which the quiescent cell is isolated. The recontructed oocyte is 
then cultured such that it develops to morula or blastocyte and then transferred to 

25 pseudopregnant female foster animal. The offspring borne of this female foster animal 
will be a clone of the animal from which the cell, e.g., the somatic cell, is isolated. 

IV. Pharmaceutical Compositions 

The TRL nucleic acid molecules, TRL proteins, and anti-TRL antibodies (also 
30 referred to herein as "active compounds") of the invention can be incorporated into 

pharmaceutical compositions suitable for administration. Such compositions typically 
comprise the nucleic acid molecule, protein, or antibody and a pharmaceutically 
acceptable carrier. As used herein the language "pharmaceutically acceptable carrier" is 
intended to include any and all solvents, dispersion media, coatings, antibacterial and 
35 antifungal agents, isotonic and absorption delaying agents, and the like, compatible with 
pharmaceutical administration. The use of such media and agents for pharmaceutically 
active substances is well known in the art. Except insofar as any conventional media or 
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agent is incompatible with the active compound, use thereof in the compositions is 
contemplated. Supplementary active compounds can also be incorporated into the 
compositions. 

A pharmaceutical composition of the invention is formulated to be compatible 
5 with its intended route of administration. Examples of routes of administration include 
parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), 
transdermal (topical), transmucosal, and rectal administration. Solutions or suspensions 
used for parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 

1 0 polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 

antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite: chelating agents such as ethylenediaminetetraacetic 
acid: buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases; 

1 5 such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 
enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 

20 extemporaneous preparation of sterile injectable solutions or dispersion. For intravenous 
administration, suitable carriers include physiological saline, bacteriostatic water, 
Cremophor EL™ (BASF. Parsippany, NJ) or phosphate buffered saline (PBS). In all 
cases, the composition must be sterile and should be fluid to the extent that easy ^ 
syringability exists. It must be stable under the conditions of manufacture and storage 

25 and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 
example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid 
polyetheylene glycol, and the like), and suitable mixtures thereof. The proper fluidity 
can be maintained, for example, by the use of a coating such as lecithin, by the 

30 maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. Prevention of the action of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 
ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium 

35 chloride in the composition. Prolonged absorption of the injectable compositions can be 
brought about by including in the composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 
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Sterile injectable solutions can be prepared by incorporating the active compound 
(e.g.. a TRL protein or anti-TRL antibody) in the required amount in an appropriate 
solvent with one or a combination of ingredients enumerated above, as required, 
followed by filtered sterilization. Generally, dispersions are prepared by incorporating 
5 the active compound into a sterile vehicle which contains a basic dispersion medium and 
the required other ingredients from those enumerated above. In the case of sterile 
powders for the preparation of sterile injectable solutions, the preferred methods of 
preparation are vacuum drying and freeze-drying which yields a powder of the active 
ingredient plus any additional desired ingredient from a previously sterile-filtered 

1 0 solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They 
can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 

1 5 using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. Pharmaceutically compatible 
binding agents, and/or adjuvant materials can be included as part of the composition. 
The tablets, pills, capsules, troches and the like can contain any of the following 
ingredients, or compounds of a similar nature: a binder such as microcrystalline 

20 cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, a 

disintegrating agent such as alginic acid. Primogel, or corn starch; a lubricant such as 
magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a sweetening 
agent such as sucrose or saccharin; or a flavoring agent such as peppermint, methyl 
salicylate, or orange flavoring. 

25 For administration by inhalation, the compounds are delivered in the form of an 

aerosol spray from pressured container or dispenser which contains a suitable propellant, 
e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be 

30 permeated are used in the formulation. Such penetrants are generally known in the an, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
compounds are formulated into ointments, salves, gels, or creams as generally known in 

35 the art. 
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The compounds can also be prepared in the form of suppositories (e.g.. with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment; the active compounds are prepared with carriers that will 
5 protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 
Methods for preparation of such formulations will be apparent to those skilled in the art. 

10 The materials can also be obtained commercially from Alza Corporation and Nova 

Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutically 
acceptable carriers. These can be prepared according to methods known to those skilled 
in the art, for example, as described in U.S. Patent No. 4,522,81 1. 

15 It is especially advantageous to formulate oral or parenteral compositions in 

dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 
calculated to produce the desired therapeutic effect in association with the required 

20 pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active compound 
and the particular therapeutic effect to be achieved, and the limitations inherent in the art 
of compounding such an active compound for the treatment of individuals. 5 

The nucleic acid molecules of the invention can be inserted into vectors and used 

25 as gene therapy vectors. Gene therapy vectors can be delivered to a subject by. for 
example, intravenous injection, local administration (see U.S. Patent 5.328,470) or by 
stereotactic injection (see e.g., Chen et al. (1994) PNAS 91 :3054-3057). The 
pharmaceutical preparation of the gene therapy vector can include the gene therapy 
vector in an acceptable diluent, or can comprise a slow release matrix in which the gene 

30 delivery vehicle is imbedded. Alternatively, where the complete gene delivery vector 
can be produced intact from recombinant cells, e.g. retroviral vectors, the pharmaceutical 
preparation can include one or more cells which produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

35 



BNSDOCID: <WO 9915663A1> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/15663 



PCT/US98/20219 



- 52 - 

V. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein homologues. and antibodies 
described herein can be used in one or more of the following methods: a) screening 
assays; b) predictive medicine (e.g., diagnostic assays, prognostic assays, monitoring 
clinical trials, and pharmacogenetics): and c) methods of treatment (e.g., therapeutic and 
prophylactic). As described herein, a TRL protein of the invention has the following 
activities: (i) formation of a homogeneous multimeric signaling complex with like TRL 
proteins; (ii) formation of a heterogeneous multimeric signaling complex with other 
TNFR superfamily proteins or other cell-surface proteins; (iii) complex formation 
between a membrane-bound TRL protein and a cytokine; (iv) complex formation 
between a soluble TRL protein and a cognate ligand; (v) interaction of a TRL protein 
with an intracellular protein having substantial homology to the TNFR-associated 
proteins: (vi) interaction of a TRL protein with a TNFR-associated protein; and (vii) 
interaction of a TRL protein with other cellular proteins including cytoplasmic proteins 
(e.g. SH2 domain-containing proteins or a second death domain-containing protein) or 
cytoskeletal proteins and can thus be used to (i) modulate cellular signal transduction: 
(ii) regulate cellular proliferation; (iii) regulate cellular differentiation; (iv) regulate cell 
survival or apoptosis; (v) modulate a cell involved in the immune response; or (vi) 
modulate a cell involved in a metabloic response (e.g., a pancreatic islet cell), either in 
vitro or in vivo. The isolated nucleic acid molecules of the invention can be used to 
express TRL protein (e.g., via a recombinant expression vector in a host cell in gene 
therapy applications), to detect TRL mRNA (e.g., in a biological sample) or a genetic 
lesion in a TRL gene, and to modulate TRL activity, as described further below. In 
addition, the TRL proteins can be used to screen drugs or compounds which modulate 
the TRL activity as well as to treat disorders characterized by insufficient or excessive 
production of TRL protein or production of TRL protein forms which have decreased or 
abherrent activity compared to TRL wild type protein (e.g. proliferative disorders such as 
cancer, for example, pancreatic cancer, inflammatory diseases such as arthritis, or 
metabolic disorders such as insulin resistance or diabetes). Moreover, soluble forms of 
the TRL protein can be used to bind ligands of membrane-bound TRL and influence 
bioavailability. In addition, the anti-TRL antibodies of the invention can be used to 
detect and isolate TRL proteins and modulate TRL activity. 

A. Screening Assays : 

The invention provides a method (also referred to herein as a "screening assay") 
for identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides. 
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peptidomimetics, small molecules or other drugs) which bind to TRL proteins or have a 
stimulatory or inhibitory effect on, for example, TRL expression or TRL activity. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of the membrane-bound form of a 
5 TRL protein or polypeptide or biologically active portion thereof. The test compounds 
of the present invention can be obtained using any of the numerous approaches in 
combinatorial library methods known in the art. including: biological libraries; spatially 
addressable parallel solid phase or solution phase libraries; synthetic library methods 
requiring deconvolution; the 'one-bead one-compound' library method; and synthetic 

10 library methods using affinity chromatography selection. The biological library 

approach is limited to peptide libraries, while the other four approaches are applicable to 
peptide, non-peptide oligomer or small molecule libraries of compounds (Lam, K.S. 
(1997) Anticancer Drug Des. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in the 

15 art, for example in: DeWitt et al. (1993) Proc. Natl, Acad Sci. U.S.A. 90:6909; Erb et 
al. (1994) Proc. NatL Acad Sci. USA 91:1 1422; Zuckermann et al. (1994). J. Med. 
Chem. 37:2678: Cho et al. (1993) Science 261 : 1303: Carrell et al. (1994) Angew. Chem. 
Int. Ed. Engl. 33:2059; Carell et al. (1994) Angew. Chem. Int. Ed. Engl. 33:2061; and in 
Gallop et al. (1994) J. Med. Chem. 37:1233. 

20 Libraries of compounds may be presented in solution (e.g.. Houghten (1992) 

Biotechniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 
(1993)yVa/w/*e 364:555-556), bacteria (Ladner USP 5.223.409). spores (Ladner USP 
'409). plasmids (Cull et al. (1992) Proc Natl Acad Sci USA 89:1865-1869) or on phage 
(Scott and Smith (1990) Science 249:386-390); (Devlin (\990)Science 249:404-406): 

25 (Cwirla et al. (1990) Proc. NatL Acad. Sci. 87:6378-6382); (Felici (1991) J. MoL Biol. 
222:301-310): (Ladner supra.). 

In one embodiment, an assay is a cell-based assay in which a cell which 
expresses a membrane-bound form of TRL protein, or a biologically active portion 
thereof, on the cell surface is contacted with a test compound and the ability of the test 

30 compound to bind to a TRL protein determined. The cell, for example, can of 

mammalian origin or a yeast cell. Determining the ability of the test compound to bind 
to the TRL protein can be accomplished, for example, by coupling the test compound 
with a radioisotope or enzymatic label such that binding of the test compound to the 
TRL protein or biologically active portion thereof can be determined by detecting the 

35 labeled compound in a complex. For example, test compounds can be labeled with 125 I, 
35 S. K C. or 3 H. either directly or indirectly, and the radioisotope detected by direct 
counting of radioemmission or by scintillation counting. Alternatively, test compounds 
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can be enzymatically labeled with, for example, horseradish peroxidase, alkaline 
phosphatase, or luciferase. and the enzymatic label detected by determination of 
conversion of an appropriate substrate to product. 

It is also within the scope of this invention to determine the ability of a 
compound (e.g., TRL ligand) to interact with a TRL protein without the labeling of any 
of the interactants. For example, a microphysiometer can be used to detect the 
interaction of a compound with its cognate receptor without the labeling of either the 
compound or the receptor. McConnelL H. M. et ai (1992) Science 257:1906-1912. As 
used herein, a "microphysiometer" (e.g., Cytosensor) is an analytical instrument that 
measures the rate at which a cell acidifies its environment using a light-addressable 
potentiometric sensor (LAPS). Changes in this acidification rate can be used as an 
indicator of the interaction between compound and receptor. 

In a preferred embodiment, the assay comprises contacting a cell which expresses 
a membrane-bound form of TRL protein, or a biologically active portion thereof, on the 
cell surface with a known compound which binds TRL to form an assay mixture, 
contacting the assay mixture with a test compound, and determining the ability of the 
test compound to interact with a TRL protein, wherein determining the ability of the test 
compound to interact with a TRL protein comprises determining the ability of the test 
compound to preferentially bind to TRL or a biologically active portion thereof as 
compared to the known compound. 

In another embodiment, an assay is a cell-based assay comprising contacting a 
cell expressing a membrane-bound form of TRL protein, or a biologically active portion 
thereof, on the cell surface with a test compound and determining the ability of the test 
compound to modulate (e.g. stimulate or inhibit) the activity of the TRL protein or 
biologically active portion thereof. Determining the ability of the test compound to 
modulate the activity of TRL or a biologically active portion thereof can be 
accomplished, for example, by determining the ability of the TRL protein to bind to or 
interact with a TRL target molecule. As used herein, a "target molecule" is a molecule 
with which a TRL protein binds or interacts in nature, for example, a molecule on the 
surface of a cell which expresses a TRL protein, a molecule on the surface of a second 
cell, a molecule in the extracellular milieu, a molecule associated with the internal 
surface of a cell membrane or a cytoplasmic molecule. A TRL target molecule can be a 
non-TRL molecule or a TRL protein or polypeptide of the present invention. In one 
embodiment, a TRL target molecule is a component of a signal transduction pathway 
which facilitates transduction of an extracellular signal (e.g. a signal generated by 
binding of a compound to a membrane-bound TRL molecule) through the cell membrane 
and into the cell. The target, for example, can be a second intercellular protein which 



SUBSTITUTE SHEET (RULE 26) 



WO 99/15663 



PCT7US98/20219 



has catalytic activity or a protein which facilitates the association of downstream 
signaling molecules with TRL. Alternatively, the target molecule can be a substrate for 
a catalytic activity of the TRL protein. 

Determining the ability of the TRL protein to bind to or interact with a TRL 
5 target molecule can be accomplished by one of the methods described above for 

determining direct binding. In a preferred embodiment, determining the ability of the 
TRL protein to bind to or interact with a TRL target molecule can be accomplished by 
determining the activity of the target molecule. For example, the activity of the target 
molecule can be determined by detecting induction of a cellular second messenger of the 

10 target (i.e. intracellular Ca 2+ . diacylglycerol. IP 3 , etc.). detecting catalytic/enzymatic 
activity of the target an appropriate substrate, detecting the induction of a reporter gene 
(comprising a TRL-responsive regulatory element operatively linked to a nucleic acid 
encoding a detectable marker, e.g. luciferase), or detecting a cellular response, for 
example, cell survival, cellular differentiation, or cell proliferation. 

15 In yet another embodiment, an assay of the present invention is a cell-free assay 

comprising contacting a TRL protein or biologically active portion thereof with a test 
compound and determining the ability of the test compound to bind to the TRL protein 
or biologically active portion thereof. Binding of the test compound to the TRL protein 
can be determined either directly or indirectly as described above. In a preferred 

20 embodiment, the assay comprises contacting the TRL protein or biologically active 
portion thereof with a known compound which binds TRL to form an assay mixture.- 
contacting the assay mixture with a test compound, and determining the ability of the 
test compound to interact with a TRL protein, wherein determining the ability of the* test 
compound to interact with a TRL protein comprises determining the ability of the test 

25 compound to preferentially bind to TRL or biologically active portion thereof as 
compared to the known compound. 

In another embodiment, an assay is a cell-free assay comprising contacting TRL 
protein or biologically active portion thereof with a test compound and determining the 
ability of the test compound to modulate (e.g. stimulate or inhibit) the activity of the 

30 TRL protein or biologically active portion thereof Determining the ability of the test 
compound to modulate the activity of TRL can be accomplished, for example, by 
determining the ability of the TRL protein to bind to a TRL target molecule by one of 
the methods described above for determining direct binding. Determining the ability of 
the TRL protein to bind to a TRL target molecule can also be accomplished using a 

35 technology such as real-time Biomolecular Interaction Analysis (BIA). Sjolander, S. and 
Urbaniczky, C (1991) Anal. Chem. 63:2338-2345 and Szabo etal (1995) Curr. Opirt. 
Struct. Biol. 5:699-705. As used herein. "BIA" is a technology for studying biospecific 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 9915663A1> 



WO 99/15663 



PO7US98/20219 



-56- 

interactions in real time, without labeling any of the interactants (e.g., BIAcore). 
Changes in the optical phenomenon of surface plasmon resonance (SPR) can be used as 
an indication of real-time reactions between biological molecules. 

In an alternative embodiment, determining the ability of the test compound to 
modulate the activity of TRL can be accomplished by determining the ability of the 
TRL protein further modulate a TRL target molecule. For example, the 
catalytic/enzymatic activity of the target molecule on an appropriate substrate can be 
determined as previously described. 

In yet another embodiment, the cell-free assay comprises contacting the TRL 
protein or biologically active portion thereof with a known compound which binds TRL 
to form an assay mixture, contacting the assay mixture with a test compound/and 
determining the ability of the test compound to interact with a TRL protein, wherein 
determining the ability of the test compound to interact with a TRL protein comprises 
determining the ability of the TRL protein to preferentially bind to or modulate the 
activity of a TRL target molecule. 

The cell-free assays of the present invention are amenable to use of both the 
soluble form or the membrane-bound form of TRL. In the case of cell-free assays 
comprising the membrane-bound form of TRL, it may be desirable to utilize a 
solubilizing such that the membrane-bound form of TRL is maintained in solution. 
Examples of such solubilizing agents include non-ionic detergents such as n- 
octylglucoside. n-dodecylglucoside, n-dodecylmaltoside. octanoyl-N-methyiglucamide. 
decanoyl-N-methylglucamide. Triton® X-100. Triton® X-l 14. Thesit®, 
Isotridecypply(ethylene glycol ether) n . 3-[(3-cholamidopropyl)dimethyIamminio]-l- 
propane sulfonate (CHAPS), 3-[(3-cholamidopropyl)dimethylamminio]-2-hydroxy-l - 
propane sulfonate (CHAPSO), or N-dodecyI=N ? N-dimethyl-3-ammonio-l -propane 
sulfonate. 

In more than one embodiment of the above assay methods of the present 
invention, it may be desirable to immobilize either TRL or its target molecule to 
facilitate separation of complexed from uncomplexed forms of one or both of the 
proteins, as well as to accommodate automation of the assay. Binding of a test 
compound to TRL, or interaction of TRL with a target molecule in the presence and 
absence of a candidate compound, can be accomplished in any vessel suitable for 
containing the reactants. Examples of such vessels include microtitre plates, test tubes, 
and micro-centrifuge tubes. In one embodiment, a fusion protein can be provided which 
adds a domain that allows one or both of the proteins to be bound to a matrix. For 
example, glutathione-S-transferase/ TRL fusion proteins or glutathione-S- 
transferase/target fusion proteins can be adsorbed onto glutathione sepharose beads 



SUBSTITUTE SHEET (RULE 26) 



WO 99/15663 



PCT/US98/20219 



- 57 - 

(Sigma Chemical. St. Louis, MO) or glutathione derivatized microtitre plates, which are 
then combined with the test compound or the test compound and either the non-adsorbed 
target protein or TRL protein, and the mixture incubated under conditions conducive to 
complex formation (e.g., at physiological conditions for salt and pH). Following 
5 incubation, the beads or microtitre plate wells are washed to remove any unbound 
components, the matrix immobilized in the case of beads, complex determined either 
directly or indirectly, for example, as described above. Alternatively, the complexes can 
be dissociated from the matrix, and the level of TRL binding or activity determined 
using standard techniques. 

10 Other techniques for immobilizing proteins on matrices can also be used in the 

screening assays of the invention. For example, either TRL or its target molecule can be 
immobilized utilizing conjugation of biotin and streptavidin. Biotinylated TRL or target 
molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using techniques 
well known in the art (e.g., biotinylation kit. Pierce Chemicals. Rockford. IL). and 

15 immobilized in the wells of streptavidin-coated 96 well plates ( Pierce Chemical). 
Alternatively, antibodies reactive with TRL or target molecules but which do not 
interfere with binding of the TRL protein to its target molecule can be derivatized to the 
wells of the plate, and unbound target or TRL trapped in the wells by antibody 
conjugation. Methods for detecting such complexes, in addition to those described 

20 above for the GST-immobilized complexes, include immunodetection of complexes 
using antibodies reactive with the TRL or target molecule, as well as enzyme-linked 
assays which rely on detecting an enzymatic activity associated with the TRL or target 
molecule. 

In another embodiment, modulators of TRL expression are identified in a method 
25 wherein a cell is contacted with a candidate compound and the expression of TRL 

mRNA or protein in the cell is determined. The level of expression of TRL mRNA or 
protein in the presence of the candidate compound is compared to the level of expression 
of TRL mRNA or protein in the absence of the candidate compound. The candidate 
compound can then be identified as a modulator of TRL expression based on this 
30 comparison. For example, when expression of TRL mRNA or protein is greater 

(statistically significantly greater) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as a stimulator of TRL mRNA or protein 
expression. Alternatively, when expression of TRL mRNA or protein is less 
(statistically significantly less) in the presence of the candidate compound than in its 
35 absence, the candidate compound is identified as an inhibitor of TRL mRNA or protein 
expression. The level of TRL mRNA or protein expression in the cells can be 
determined by methods described herein for detecting TRL mRNA or protein. 
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In yet another aspect of the invention, the TRL proteins can be used as "bait 
proteins" in a two-hybrid assay (see, e.g., U.S. Patent No. 5,283,317; Zervos et al. (1993) 
Cell 72:223-232; Madura et al. (1993) J. Biol Chem. 268:12046-12054; Bartel et al. 
(1993) Biotechniques 14:920-924; Iwabuchi et al. (1993) Oncogene 8:1693-1696; and 
Brent WO94/10300), to identify other proteins, which bind to or interact with TRL 
("TRL-binding proteins" or "TRL-bp") and modulate TRL activity. Such TRL-binding 
proteins are also likely to be involved in the propagation of signals by the TRL proteins 
as, for example, upstream or downstream elements of the TRL pathway. 

The two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that codes for 
TRL is fused to a gene encoding the DNA binding domain of a known transcription 
factor (e.g.. GAL-4), In the other construct, a DNA sequence, from a library of DNA 
sequences, that encodes an unidentified protein ("prey" or "sample") is fused to a gene 
that codes for the activation domain of the known transcription factor. If the "bait" and 
the "prey" proteins are able to interact, in vivo, forming a TRL -dependent complex, the 
DNA-binding and activation domains of the transcription factor are brought into close 
proximity. This proximity allows transcription of a reporter gene (e.g.. LacZ) which is 
operably linked to a transcriptional regulatory site responsive to the transcription factor. 
Expression of the reporter gene can be detected and cell colonies containing the 
functional transcription factor can be isolated and used to obtain the cloned gene which 
encodes the protein which interacts with TRL. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 
agent identified as described herein in an appropriate animal model. For example, an 
agent identified as described herein (e.g., a TRL modulating agent, an antisense TRL 
nucleic acid molecule, a TRL -specific antibody, or a TRL-binding partner) can be used 
in an animal model to determine the efficacy, toxicity, or side effects of treatment with 
such an agent. Alternatively, an agent identified as described herein can be used in an 
animal model to determine the mechanism of action of such an agent. Furthermore, this 
invention pertains to uses of novel agents identified by the above-described screening 
assays for treatments as described herein. 

B- Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and the 
corresponding complete gene sequences) can be used in numerous ways as 
polynucleotide reagents. For example, these sequences can be used to: (i) map their 
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respective genes on a chromosome; and, thus, locate gene regions associated with 
genetic disease; (ii) identify an individual from a minute biological sample (tissue 
typing); and (iii) aid in forensic identification of a biological sample. These applications 
are described in the subsections below. 

5 

1. Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
sequence can be used to map the location of the gene on a chromosome. This process is 
called chromosome mapping. Accordingly, portions or fragments of the TRL nucleotide 

10 sequences, described herein, can be used to map the location of the TRL genes on a 

chromosome. The mapping of the TRL sequences to chromosomes is an important first 
step in correlating these sequences with genes associated with disease. 

Briefly. TRL genes can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp in length) from the TRL nucleotide sequences. Computer analysis 

15 of the TRL sequences can be used to predict primers that do not span more than one* 
exon in the genomic DNA, thus complicating the amplification process. These primers 
can then be used for PCR screening of somatic cell hybrids containing individual human 
chromosomes. Only those hybrids containing the human gene corresponding to the TRL 
sequences will yield an amplified fragment. 

20 Somatic cell hybrids are prepared by fusing somatic cells from different 

mammals (e.g.. human and mouse cells). As hybrids of human and mouse cells grow 
and divide, they gradually lose human chromosomes in random order, but retain the, 
mouse chromosomes. By using media in which mouse cells cannot grow, because they 
lack a particular enzyme, but human cells can, the one human chromosome that contains 

25 the gene encoding the needed enzyme, will be retained. By using various media, panels 
of hybrid cell lines can be established. Each cell line in a panel contains either a single 
human chromosome or a small number of human chromosomes, and a full set of mouse 
chromosomes, allowing easy mapping of individual genes to specific human 
chromosomes. (D'Eustachio P. et al. (1983) Science 220:919-924). Somatic cell hybrids 

30 containing only fragments of human chromosomes can also be produced by using human 
chromosomes with translocations and deletions. 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a 
particular sequence to a particular chromosome. Three or more sequences can be 
assigned per day using a single thermal cycler. Using the TRL nucleotide sequences to 

35 design oligonucleotide primers, sublocalization can be achieved with panels of fragments 
from specific chromosomes. Other mapping strategies which can similarly be used to 
map a 9o, Ip. or 1 v sequence to its chromosome include in situ hybridization (described 
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in Fan, Y. et al. (1990) PNAS, 87:6223-27), pre-screening with labeled flow-sorted 
chromosomes, and pre-selection by hybridization to chromosome specific cDNA 
libraries. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
5 chromosomal spread can further be used to provide a precise chromosomal location in 
one step. Chromosome spreads can be made using cells whose division has been 
blocked in metaphase by a chemical such as colcemid that disrupts the mitotic spindle. 
The chromosomes can be treated briefly with trypsin, and then stained with Giemsa. A 
pattern of light and dark bands develops on each chromosome, so that the chromosomes 

10 can be identified individually. The FISH technique can be used with a DNA sequence as 
short as 500 or 600 bases. However, clones larger than 1 ,000 bases have a higher 
likelihood of binding to a unique chromosomal location with sufficient signal intensity 
for simple detection. Preferably 1.000 bases, and more preferably 2.000 bases will 
suffice to get good results at a reasonable amount of time. For a review of this 

1 5 technique, see Verrna et al . Human Chromosomes: A Manual of Basic Techniques 
(Pergamon Press, New York 1988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. Reagents corresponding to 

20 noncoding regions of the genes actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved within gene families, thus increasing the 
chance of cross hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with genetic map 

25 data. (Such data are found, for example, in V. McKusick. Mendelian Inheritance in 

Man, available on-line through Johns Hopkins University Welch Medical Library)- The 
relationship between a gene and a disease, mapped to the same chromosomal region, can 
then be identified through linkage analysis (co-inheritance of physically adjacent genes), 
described in, for example, Egeland, J. et al (1987) Nature, 325:783-787. 

30 Moreover, differences in the DNA sequences between individuals affected and 

unaffected with a disease' associated with the TRJL gene, can be determined. If a 
mutation is observed in some or all of the affected individuals but not in any unaffected 
individuals, then the mutation is likely to be the causative agent of the particular disease. 
Comparison of affected and unaffected individuals generally involves first looking for 

35 structural alterations in the chromosomes, such as deletions or translocations that are 

visible from chromosome spreads or detectable using PCR based on that DNA sequence. 
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Ultimately, complete sequencing of genes from several individuals can be performed to 
confirm the presence of a mutation and to distinguish mutations from polymorphisms. 

2. Tissue Typing 

5 The TRL sequences of the present invention can also be used to identify 

individuals from minute biological samples. The United States military, for example, is 
considering the use of restriction fragment length polymorphism (RFLP) for 
identification of its personnel. In this technique, an individual's genomic DNA is 
digested with one or more restriction enzymes, and probed on a Southern blot to yield 

10 unique bands for identification. This method does not suffer from the current limitations 
of "Dog Tags" which can be lost, switched, or stolen, making positive identification 
difficult. The sequences of the present invention are useful as additional DNA markers 
for RFLP (described in U.S. Patent 5,272,057). 

Furthermore, the sequences of the present invention can be used to provide an 

1 5 alternative technique which determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the TRL nucleotide sequences 
described herein can be used to prepare two PCR primers from the 5 f and 3' ends of the 
sequences. These primers can then be used to amplify an individual's DNA and 
subsequently sequence it. 

20 Panels of corresponding DNA sequences from individuals, prepared in this 

manner, can provide unique individual identifications, as each individual will have a 
unique set of such DNA sequences due to allelic differences. The sequences of the 
present invention can be used to obtain such identification sequences from individuals 
and from tissue. The TRL nucleotide sequences of the invention uniquely represent 

25 portions of the human genome. Allelic variation occurs to some degree in the coding 
regions of these sequences, and to a greater degree in the noncoding regions. It is 
estimated that allelic variation between individual humans occurs with a frequency of 
about once per each 500 bases. Each of the sequences described herein can, to some 
degree, be used as a standard against which DNA from an individual can be compared 

30 for identification purposes. Because greater numbers of polymorphisms occur in the 
noncoding regions, fewer sequences are necessary to differentiate individuals. The 
noncoding sequences of SEQ ID NO:l. can comfortably provide positive individual 
identification with a panel of perhaps 10 to 1,000 primers which each yield a noncoding 
amplified sequence of 100 bases. If predicted coding sequences, such as those in SEQ 

35 ID NO:3 are used, a more appropriate number of primers for positive individual 
identification would be 500-2,000. 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID:<WO 9915663A1> 



WO 99/15663 



PCT/US98/20219 



-62- 

If a panel of reagents from TRL nucleotide sequences described herein is used to 
generate a unique identification database for an individual, those same reagents can later 
be used to identify tissue from that individual. Using the unique identification database, 
positive identification of the individual, living or dead, can be made from extremely 
5 small tissue samples. 

3. Use of Partial TRL Sequences in Forensic Biology 

DNA-based identification techniques can also be used in forensic biology. 
Forensic biology is a scientific field employing genetic typing of biological evidence 

10 found at a crime scene as a means for positively identifying, for example, a perpetrator 
of a crime. To make such an identification. PCR technology can be used to amplify 
DNA sequences taken from very small biological samples such as tissues, e.g., hair or 
skin, or body fluids, e.g., blood, saliva, or semen found at a crime scene. The amplified 
sequence can then be compared to a standard, thereby allowing identification of the 

1 5 origin of the biological sample. 

The sequences of the present invention can be used to provide polynucleotide 
reagents, e.g., PCR primers, targeted to specific loci in the human genome, which can 
enhance the reliability of DNA-based forensic identifications by, for example, providing 
another "identification marker" (i.e. another DNA sequence that is unique to a particular 

20 individual). As mentioned above, actual base sequence information can be used for 
identification as an accurate alternative to patterns formed by restriction enzyme 
generated fragments. Sequences targeted to noncoding regions of SEQ ID NO:l, SEQ 
ID NO:3, or SEQ ID NO:22 are particularly appropriate for this use as greater numbers 
of polymorphisms occur in the noncoding regions, making it easier to differentiate 

25 individuals using this technique. Examples of polynucleotide reagents include the TRL 
nucleotide sequences or portions thereof, e.g., fragments derived from the noncoding 
regions of SEQ ID NO: 1 , SEQ ID NO:3, or SEQ ID NO:22, having a length of at least 
20 bases, preferably at least 30 bases. 

The TRL nucleotide sequences described herein can further be used to provide 

30 polynucleotide reagents, e.g., labeled or labelable probes which can be used in, for 
example, an in situ hybridization technique, to identify a specific tissue, e.g., brain 
tissue. This can be very useful in cases where a forensic pathologist is presented with a 
tissue of unknown origin. Panels of such TRL probes can be used to identify tissue by 
species and/or by organ type. 

35 In a similar fashion, these reagents, e.g., TRL primers or probes can be used to 

screen tissue culture for contamination (i.e. screen for the presence of a mixture of 
different types of cells in a culture). 
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C. Predictive Medicine : 

The present invention also pertains to the field of predictive medicine in which 
diagnostic assays, prognostic assays, and monitoring clinical trails are used for 
5 prognostic (predictive) purposes to thereby treat an individual prophylactically. 
Accordingly, one aspect of the present invention relates to diagnostic assays for 
determining TRL protein and/or nucleic acid expression as well as TRL activity, in the 
context of a biological sample (e.g., blood, serum, cells, tissue) to thereby determine 
whether an individual is afflicted with a disease or disorder, or is at risk of developing a 
10 disorder, associated with aberrant TRL expression or activity. The invention also 

provides for prognostic (or predictive) assays for determining whether an individual is at 
risk of developing a disorder associated with TRL protein, nucleic acid expression or 
activity. For example, mutations in a TRL gene can be assayed in a biological sample. 
Such assays can be used for prognostic or predictive purpose to thereby phophylactically 
1 5 treat an individual prior to the onset of a disorder characterized by or associated with 
TRL protein, nucleic acid expression or activity. 

Another aspect of the invention pertains to monitoring the influence of agents 
(e.g., drugs, compounds) on the expression or activity of TRL in clinical trials. 

These and other agents are described in further detail in the following sections. 

20 

1. Diagnostic Assays : 

An exemplary method for detecting the presence or absence of TRL protein or 
nucleic acid in a biological sample involves obtaining a biological sample from a test 
subject and contacting the biological sample with a compound or an agent capable of 

25 detecting TRL protein or nucleic acid (e.g., mRNA, genomic DNA) that encodes TRL 
protein such that the presence of TRL is detected in the biological sample. A preferred 
agent for detecting TRL mRNA or genomic DNA is a labeled nucleic acid probe capable 
of hybridizing to TRL mRNA or genomic DNA. The nucleic acid probe can be, for 
example, the full-length TRL cDNA of SEQ ID NO: 1 or SEQ ID NO:3, or a portion 

30 thereof, such as an oligonucleotide of at least 15, 30, 50, 100. 250 or 500 nucleotides in 
length and sufficient to specifically hybridize under stringent conditions to TRL mRNA 
or genomic DNA. Other suitable probes for use in the diagnostic assays of the invention 
are described herein. 

A preferred agent for detecting TRL protein is a labeled antibody capable of 

35 binding to TRL protein. Antibodies can be polyclonal, or more preferably, monoclonal. 
An intact antibody, or a fragment thereof (e.g., Fab or F(ab')2) can be used. The term 
"labeled", with regard to the probe or antibody, is intended to encompass direct labeling 
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of the probe or antibody by coupling (i.e., physically linking) a detectable substance to 
the probe or antibody ^ as well as indirect labeling of the probe or antibody by reactivity 
with another reagent that is directly labeled. Examples of indirect labeling include 
detection of a primary antibody using a fluorescently labeled secondary antibody and 
5 end-labeling of a DNA probe with biotin such that it can be detected with fluorescently 
labeled streptavidin. The term "biological sample" is intended to include tissues, cells 
and biological fluids isolated from a subject, as well as tissues, cells and fluids present 
within a subject. That is ? the detection method of the invention can be used to detect 
TRL mRNA or protein in a biological sample in vitro as well as in vivo. For example, in 

10 vitro techniques for detection of TRL mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of TRL protein include enzyme linked 
immunosorbent assays (ELISAs). Western blots, immunoprecipitations and 
immunofluorescence. In vitro techniques for detection of TRL genomic DNA include 
Southern hybridizations. Furthermore, in vivo techniques for detection of TRL protein 

15 include introducing into the subject a labeled anti-TRL antibody. For example, the 
antibody can be labeled with a radioactive marker whose presence and location in a 
subject can be detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the 
test subject. Alternatively, the biological sample can contain mRNA molecules from the 

20 test subject or genomic DNA molecules from the test subject. A preferred biological 
sample is a tissue sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control 
biological sample from a control subject, contacting the control sample with a compound 
or agent capable of detecting TRL protein. mRNA. or genomic DNA. such that the 

25 presence of TRL protein, mRNA or genomic DNA is detected in the biological sample, 
and comparing the presence of TRL protein, mRNA or genomic DNA in the control 
sample with the presence of TRL protein. mRNA or genomic DNA in the test sample. 

The invention also encompasses kits for detecting the presence of TRL in a 
biological sample. For example, the kit can comprise a labeled compound or agent 

30 capable of detecting TRL protein or mRNA in a biological sample; means for 

determining the amount of TRL in the sample; and means for comparing the amount of 
TRL in the sample with a standard. The compound or agent can be packaged in a 
suitable container. The kit can further comprise instructions for using the kit to detect 
TRL mRNA or protein. 
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2. Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with aberrant 
TRL expression or activity. As used herein, the term "aberrant" includes a TRL 
5 expression or activity which deviates from the wild type TRL expression or activity. 
Aberrant expression or activity includes increased or decreased expression or activity, as 
well as expression or activity which does not follow the wild type developmental pattern 
of expression or the subcellular pattern of expression. For example, aberrant TRL 
expression or activity is intended to include the cases in which a mutation in the TRL 

10 gene causes the TRL gene to be under-expressed or over-expressed and situations in 

which such mutations result in a non-functional TRL protein or a protein which does not 
function in a wild-type fashion. 

The assays described herein, such as the preceding diagnostic assays or the 
following assays, can be utilized to identify a subject having or at risk of developing a 

15 disorder associated with TRL protein, nucleic acid expression or activity such as diabetes 
or cancer. Alternatively, the prognostic assays can be utilized to identify a subject 
having or at risk for developing diabetes or cancer. Thus, the present invention provides 
a method for identifying a disease or disorder associated with aberrant TRL expression 
or activity in which a test sample is obtained from a subject and TRL protein or nucleic 

20 acid (e.g, mRNA, genomic DNA) is detected, wherein the presence of TRL protein or 
nucleic acid is diagnostic for a subject having or at risk of developing a disease or 
disorder associated with aberrant TRL expression or activity. As used herein, a "test 
sample" refers to a biological sample obtained from a subject of interest. For example, a 
test sample can be a biological fluid (e.g.. serum), cell sample, or tissue. 

25 Furthermore, the prognostic assays described herein can be used to determine 

whether a subject can be administered an agent (e.g.. an agonist, antagonist, 
peptidomimetic. protein, peptide, nucleic acid, small molecule, or other drug candidate) 
to treat a disease or disorder associated with aberrant TRL expression or activity. For 
example, such methods can be used to determine whether a subject can be effectively 

30 treated with an agent for diabetes or cancer. Thus, the present invention provides 

methods for determining whether a subject can be effectively treated with an agent for a 
disorder associated with aberrant TRL expression or activity in which a test sample is 
obtained and TRL protein or nucleic acid expression or activity is detected (e.g.. wherein 
the abundance of TRL protein or nucleic acid expression or activity is diagnostic for a 

35 subject that can be administered the agent to treat a disorder associated with aberrant 
TRL expression or activity.) 
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The methods of the invention can also be used to detect genetic alterations in a 
TRL gene, thereby determining if a subject with the altered gene is at risk for a disorder 
characterized by aberrant cell metabolism, proliferation and/or differentiation. In 
preferred embodiments, the methods include detecting, in a sample of cells from the 
5 subject, the presence or absence of a genetic alteration characterized by at least one of an 
alteration affecting the integrity of a gene encoding a TRL-protein. or the mis-expression 
of the TRL gene. For example, such genetic alterations can be detected by ascertaining 
the existence of at least one of 1) a deletion of one or more nucleotides from a TRL gene; 
2) an addition of one or more nucleotides to a TRL gene; 3) a substitution of one or more 

10 nucleotides of a TRL gene. 4) a chromosomal rearrangement of a TRL gene; 5) an 
alteration in the level of a messenger RNA transcript of a TRL gene. 6) aberrant 
modification of a TRL gene, such as of the methylation pattern of the genomic DNA, 7) 
the presence of a non-wild type splicing pattern of a messenger RNA transcript of a TRL 
gene, 8) a non-wild type level of a TRL-protein, 9) allelic loss of a TRL gene, and 10) 

15 inappropriate post-translational modification of a TRL-protein. As described herein, 
there are a large number of assay techniques known in the art which can be used for 
detecting alterations in a TRL gene. A preferred biological sample is a tissue or serum 
sample isolated by conventional means from a subject. 

In certain embodiments, detection of the alteration involves the use of a 

20 probe/primer in a polymerase chain reaction (PCR) (see, e.g. U.S. Patent Nos. 4.683,195 
and 4.683.202), such as anchor PCR or RACE PCR. or. alternatively, in a ligation chain 
reaction (LCR) (see, e.g., Landegran et al. (1988) Science 241:1077-1080; and 
Nakazawa et al. (1994) PNAS 91 :360-364). the latter of which can be particularly useful 
for detecting point mutations in the TRL-gene (see Abravaya et al. (1995) Nucleic Acids 

25 Res .23:675-682). This method can include the steps of collecting a sample of cells from 
a patient, isolating nucleic acid (e.g., genomic, mRNA or both) from the cells of the 
sample, contacting the nucleic acid sample with one or more primers which specifically 
hybridize to a TRL gene under conditions such that hybridization and amplification of 
the TRL-gene (if present) occurs, and detecting the presence or absence of an 

30 amplification product, or detecting the size of the amplification product and comparing 
the length to a control sample. It is anticipated that PCR and/or LCR may be desirable to 
use as a preliminary amplification step in conjunction with any of the techniques used 
for detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication 

35 (Guatelli. J.C et al. 1990, Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional 
amplification system (Kwoh. D.Y. et al. 1989. Proc. Natl. Acad. Sci. USA 86:1 173- 
1 177), Q-Beta Replicase (Lizardi, P.M. et all. 1988, Bio/Technology 6:1 197), or any 

SUBSTITUTE SHEET (RULE 26) 

BNSDOCID:<WO 9915663A1> 



WO 99/15663 



PCT/US98/202I9 



-67- 

other nucleic acid amplification method, followed by the detection of the amplified 
molecules using techniques well known to those of skill in the art. These detection 
schemes are especially useful for the detection of nucleic acid molecules if such 
molecules are present in very low numbers. 
5 In an alternative embodiment, mutations in a TRL gene from a sample cell can be 

identified by alterations in restriction enzyme cleavage patterns. For example, sample 
and control DNA is isolated, amplified (optionally), digested with one or more 
restriction endonucleases. and fragment length sizes are determined by gel 
electrophoresis and compared. Differences in fragment length sizes between sample and 
10 control DNA indicates mutations in the sample DNA. Moreover, the use of sequence 

specific ribozymes (see. for example. U.S. Patent No. 5,498.531) can be used to score for 
the presence of specific mutations by development or loss of a ribozyme cleavage site. 

In other embodiments, genetic mutations in TRL can be identified by hybridizing 
a sample and control nucleic acids, e.g., DNA or RNA. to high density arrays containing 

15 hundreds or thousands of oligonucleotides probes (Cronin. M.T. et al (1996) Human 
Mutation 7: 244-255; KozaL M.J. et al. (1996) Nature Medicine 2: 753-759). For 
example, genetic mutations in TRL can be identified in two dimensional arrays 
containing light-generated DNA probes as described in Cronin, M.T. et aL supra. 
Briefly, a first hybridization array of probes can be used to scan through long stretches of 

20 DNA in a sample and control to identify base changes between the sequences by making 
linear arrays of sequential ovelapping probes. This step allows the identification of point 
mutations. This step is followed by a second hybridization array that allows the 
characterization of specific mutations by using smaller, specialized probe arrays * 
complementary to all variants or mutations detected. Each mutation array is composed 

25 of parallel probe sets, one complementary to the wild-type gene and the other 
complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in the 
an can be used to directly sequence the TRL gene and detect mutations by comparing the 
sequence of the sample TRL with the corresponding wild-type (control) sequence. 

30 Examples of sequencing reactions include those based on techniques developed by 

Maxim and Gilbert ((1977) PNAS 74:560) or Sanger ((1977) PNAS 74:5463). It is also 
contemplated that any of a variety of automated sequencing procedures can be utilized 
when performing the diagnostic assays ((1995) Biotechniques 19:448), including 
sequencing by mass spectrometry (see. e.g., PCT International Publication No. WO 

35 94/16101; Cohen et al. (1996) Adv. Chromatogr. 36:127-162; and Griffin et al. (1993) 
Appl. Biochem. Biotechnol 38:147-159). 
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Other methods for detecting mutations in the TRL gene include methods in 
which protection from cleavage agents is used to detect mismatched bases in RN A/RN A 
or RNA/DNA heteroduplexes (Myers et al. ( 1 985) Science 230: 1 242). In general, the art 
technique of "mismatch cleavage" starts by providing heteroduplexes of formed by 
5 hybridizing (labeled) RNA or DNA containing the wild-type TRL sequence with 

potentially mutant RNA or DNA obtained from a tissue sample. The double-stranded 
duplexes are treated with an agent which cleaves single-stranded regions of the duplex 
such as which will exist due to basepair mismatches between the control and sample 
strands. For instance. RNA/DNA duplexes can be treated with RNase and DNA/DNA 

10 hybrids treated with SI nuclease to enzymatically digesting the mismatched regions. In 
other embodiments, either DNA/DNA or RNA/DNA duplexes can be treated with 
hydroxylamine or osmium tetroxide and with piperidine in order to digest mismatched 
regions. After digestion of the mismatched regions, the resulting material is then 
separated by size on denaturing polyacrylamide gels to determine the site of mutation. 

15 See. for example. Cotton et aL (1988) Proc. Natl Acad Sci USA 85:4397; Saleeba et al. 
(1992) Methods EnzymoL 217:286-295. In a preferred embodiment, the control DNA or 
RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or 
more proteins that recognize mismatched base pairs in double-stranded DNA (so called 

20 ifc DNA mismatch repair* enzymes) in defined systems for detecting and mapping point 
mutations in TRL cDNAs obtained from samples of cells. For example, the mutY 
enzyme of E. coli cleaves A at G/A mismatches and the thymidine DNA glycosylase 
from HeLa cells cleaves T at G/T mismatches (Hsu et al. ( 1 994) Carcinogenesis 
15:1657-1662). According to an exemplary embodiment, a probe based on a TRL 

25 sequence, e.g.. a wild-type TRL sequence, is hybridized to a cDNA or other DNA 

product from a test cell(s). The duplex is treated with a DNA mismatch repair enzyme, 
and the cleavage products, if any, can be detected from electrophoresis protocols or the 
like. See. for example, U.S. Patent No. 5.459,039. 

In other embodiments, alterations in electrophoretic mobility will be used to 

30 identify mutations in TRL genes. For example, single strand conformation 

polymorphism (SSCP) may be used to detect differences in electrophoretic mobility 
between mutant and wild type nucleic acids (Orita et aL (1989) Proc Natl. Acad. Sci 
USA: 86:2766. see also Cotton (1993) Mutat Res 285:125-144; and Hayashi (1992) 
Genet Anal Tech Appl 9:73-79). Single-stranded DNA fragments of sample and control 

35 TRL nucleic acids will be denatured and allowed to renature. The secondary structure of 
single-stranded nucleic acids varies according to sequence, the resulting alteration in 
electrophoretic mobility enables the detection of even a single base change. The DNA 
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fragments may be labeled or detected with labeled probes. The sensitivity of the assay 
may be enhanced by using RNA (rather than DNA), in which the secondary structure is 
more sensitive to a change in sequence. In a preferred embodiment, the subject method 
utilizes heteroduplex analysis to separate double stranded heteroduplex molecules on the 
5 basis of changes in electrophoretic mobility (Keen et al. (1991) Trends Genet 7:5). 

In yet another embodiment the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE) (Myers et al. (1985) Nature 313:495). When 
DGGE is used as the method of analysis. DNA will be modified to insure that it does not 

10 completely denature, for example by adding a GC clamp of approximately 40 bp of high- 
melting GC-rich DNA by PCR. In a further embodiment, a temperature gradient is used 
in place of a denaturing gradient to identify differences in the mobility of control and 
sample DNA (Rosenbaum and Reissner (1987) Biophys Chem 265:12753). 

Examples of other techniques for detecting point mutations include, but are not 

15 limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
which permit hybridization only if a perfect match is found (Saiki et al. (1986) Nature 
324:163); Saiki et al (1989) Proc. Natl Acad. Sci USA 86:6230). Such allele specific 

20 oligonucleotides are hybridized to PCR amplified target DNA or a number of different 
mutations when the oligonucleotides are attached to the hybridizing membrane and 
hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology which depends on selective 
PCR amplification may be used in conjunction with the instant invention. 

25 Oligonucleotides used as primers for specific amplification may cany the mutation of 
interest in the center of the molecule (so that amplification depends on differential 
hybridization) (Gibbs et aL (1989) Nucleic Acids Res. 17:2437-2448) or at the extreme 3' 
end of one primer where, under appropriate conditions, mismatch can prevent or reduce 
polymerase extension (Prossner (1993) Tibtech 1 1:238). In addition it may be desirable 

30 to introduce a novel restriction site in the region of the mutation to create cleavage-based 
detection (Gasparini et aL (1992) Mol. Cell Probes 6: 1). It is anticipated that in certain 
embodiments amplification may also be performed using Taq ligase for amplification 
(Barany (1991) Proc. Natl. Acad. Sci USA 88:189). In such cases, ligation will occur only 
if there is a perfect match at the 3' end of the 5' sequence making it possible to detect the 

35 presence of a known mutation at a specific site by looking for the presence or absence of 
amplification. 
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The methods described herein may be performed, for example, by utilizing pre- 
packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
described herein, which may be conveniently used, e.g., in clinical settings to diagnose 
patients exhibiting symptoms or family history of a disease or illness involving a TRL 
5 gene. 

Furthermore, any cell type or tissue in which TRL is expressed may be utilized in 
the prognostic assays described herein. 

3. Monitoring of Effects During Clinical Trials 

10 Monitoring the influence of agents (e.g.. drugs, compounds) on the expression or 

activity of a TRL protein (e.g., modulation an inflammatory response) an be applied not 
only in basic drug screening, but also in clinical trials. For example, the effectiveness of 
an agent determined by a screening assay as described herein to increase TRL gene 
expression, protein levels, or upregulate TRL activity, can be monitored in clinical trails 

15 of subjects exhibiting decreased TRL gene expression, protein levels, or downregulated 
TRL activity. Alternatively, the effectiveness of an agent determined by a screening 
assay to decrease TRL gene expression, protein levels, or downregulate TRL activity, 
can be monitored in clinical trails of subjects exhibiting increased TRL gene expression, 
protein levels, or upregulated TRL activity. In such clinical trials, the expression or 

20 activity of a TRL gene, and preferably, other genes that have been implicated in. for 

example, a proliferative disorder can be used as a "read out" or markers of the phenotype 
of a particular cell. 

For example, and not by way of limitation, genes, including TRL. that are 
modulated in cells by treatment with an agent (e.g., compound, drug or small molecule) 

25 which modulates TRL activity (e.g., identified in a screening assay as described herein) 
can be identified. Thus, to study the effect of agents on proliferative disorders, for 
example, in a clinical trial, cells can be isolated and RN A prepared and analyzed for the 
levels of expression of TRL and other genes implicated in the proliferative disorder, 
respectively. The levels of gene expression (i.e., a gene expression pattern) can be 

30 quantified by Northern blot analysis or RT-PCR. as described herein, or alternatively by 
measuring the amount* of protein produced, by one of the methods as described herein, or 
by measuring the levels of activity of TRL or other genes. In this way, the gene 
expression pattern can serve as a marker, indicative of the physiological response of the 
cells to the agent. Accordingly, this response state may be determined before, and at 

35 various points during treatment of the individual with the agent. 

In a preferred embodiment, the present invention provides a method for 
monitoring the effectiveness of treatment of a subject with an agent (e.g., an agonist. 
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antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug 
candidate identified by the screening assays described herein) comprising the steps of (i) 
obtaining a pre-administration sample from a subject prior to administration of the agent; 
(ii) detecting the level of expression of a TRL protein, mRNA. or genomic DNA in the 
5 preadministration sample; (iii) obtaining one or more post-administration samples from 
the subject; (rv) detecting the level of expression or activity of the TRL protein, mRNA, 
or genomic DNA in the post-administration samples; (v) comparing the level of 
expression or activity of the TRL protein. mRNA ? or genomic DNA in the pre- 
administration sample with the TRL protein, mRNA, or genomic DNA in the post 

10 administration sample or samples; and (vi) altering the administration of the agent to the 
subject accordingly. For example, increased administration of the agent may be 
desirable to increase the expression or activity of TRL to higher levels than detected, i.e., 
to increase the effectiveness of the agent. Alternatively, decreased administration of the 
agent may be desirable to decrease expression or activity of TRL to lower levels than 

15 detected, i.e. to decrease the effectiveness of the agent. According to such an 

embodiment. TRL expression or activity may be used as an indicator of the effectiveness 
of an agent, even in the absence of an observable phenotypic response. 

C. Methods of Treatment 

20 The present invention provides for both prophylactic and therapeutic methods of 

treating a subject at risk of (or susceptible to) a disorder or having a disorder associated 
with aberrant TRL expression or activity. With regards to both prophylactic and 
therapeutic methods of treatment, such treatments may be specifically tailored or 
modified, based on knowledge obtained from the field of pharmacogenomics. 

25 "Pharmacogenomics", as used herein, refers to the application of genomics technologies 
such as gene sequencing, statistical genetics, and gene expression analysis to drugs in 
clinical development and on the market. More specifically, the term refers the study of 
how a patient's genes determine his or her response to a drug (e.g., a patient's "drug 
response phenotype", or "drug response genotype".) Thus, another aspect of the 

30 invention provides methods for tailoring an individual's prophylactic or therapeutic 
treatment with either the TRL molecules of the present invention or TRL modulators 
according to that individual's drug response genotype. Pharmacogenomics allows a 
clinician or physician to target prophylactic or therapeutic treatments to patients who 
will most benefit from the treatment and to avoid treatment of patients who will 

35 experience toxic drug-related side effects. 



BNSDOCID: <WO 9915663A1> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/15663 



PCT/US98/20219 



- 72 - 

1. Prophylactic Methods 

In one aspect, the invention provides a method for preventing in a subject, a 
disease or condition associated with an aberrant TRL expression or activity, by 
administering to the subject a TRL or an agent which modulates TRL expression or at 
5 least one TRL activity. Subjects at risk for a disease which is caused or contributed to 
by aberrant TRL expression or activity can be identified by, for example, any or a 
combination of diagnostic or prognostic assays as described herein. Administration of a 
prophylactic agent can occur prior to the manifestation of symptoms characteristic of the 
TRL aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in 
1 0 its progression. Depending on the type of TRL aberrancy, for example, a TRL. TRL 
agonist or TRL antagonist agent can be used for treating the subject. The appropriate 
agent can be determined based on screening assays described herein. The prophylactic 
methods of the present invention are further discussed in the following subsections. 

15 2. Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating TRL 
expression or activity for therapeutic purposes. Accordingly, in an exemplary 
embodiment, the modulatory method of the invention involves contacting a cell with a 
TRL or agent that modulates one or more of the activities of TRL protein activity 

20 associated with the cell. An agent that modulates TRL protein activity can be an agent 
as described herein, such as a nucleic acid or a protein, a naturally-occurring target 
molecule of a TRL protein (e.g., a naturally-occurring cognate ligand of a TRL protein), 
a TRL antibody, a TRL agonist or antagonist, a peptidomimetic of a TRL agonist or 
antagonist, or other small molecule. In one embodiment, the agent stimulates one or 

25 more TRL protein activity. Examples of such stimulatory agents include active TRL 

protein and a nucleic acid molecule encoding TRL that has been introduced into the cell. 
In another embodiment, the agent inhibits one or more TRL protein activity. Examples 
of such inhibitory agents include antisense TRL nucleic acid molecules and anti-TRL 
antibodies. These modulatory methods can be performed in vitro (e.g., by culturing the 

30 cell with the agent) or, alternatively, in vivo (e,g, by administering the agent to a 
subject). As such, the present invention provides methods of treating an individual 
afflicted with a disease or disorder characterized by aberrant expression or activity of a 
TRL protein or nucleic acid molecule. In one embodiment, the method involves 
administering an agent (e.g., an agent identified by a screening assay described herein). 

35 or combination of agents that modulates (e.g., upregulates or downregulates) TRL 
expression or activity. In another embodiment, the method involves administering a 
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TRL protein or nucleic acid molecule as therapy to compensate for reduced or aberrant 
TRL expression or activity. 

Stimulation of TRL activity is desirable in situations in which TRL is abnormally 
downregulated and/or in which increased TRL activity is likely to have a beneficial 
5 effect. One example of such a situation is where a subject has a disorder characterized 
by abherrent cell proliferation (e.g. cancer). Another example of such a situation is 
where the subject has a inflammaroty disease (e.g. arthritis). 

3- Pharmacogenomics 

10 The TRL molecules of the present invention, as well as agents, or modulators 

which have a stimulatory or inhibitory effect on TRL activity (e.g., TRL gene 
expression) as identified by a screening assay described herein can be administered to 
individuals to treat (prophylactically or therapeutically) disorders (e.g. diabetes or 
cancer, for example, pancreatic cancer) associated with aberrant TRL activity. In 

15 conjunction with such treatment, pharmacogenomics (i.e.. the study of the relationship 
between an individual's genotype and that individual's response to a foreign compound 
or drug) may be considered. Differences in metabolism of therapeutics can lead to 
severe toxicity or therapeutic failure by altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, a physician or clinician may 

20 consider applying knowledge obtained in relevant pharmacogenomics studies in 
determining whether to administer a TRL molecule or TRL modulator as well as 
tailoring the dosage and/or therapeutic regimen of treatment with a TRL molecule or 
TRL modulator. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
25 response to drugs due to altered drug disposition and abnormal action in affected 

persons. See e.g., Eichelbaum, M., Clin Exp Pharmacol Physiol, 1996, 23(10-1 1) :983- 
985 and Linder. M.W., Clin Chem, 1997. 43(2):254-266. In general, two types of 
pharmacogenetic conditions can be differentiated. Genetic conditions transmitted as a 
single factor altering the way drugs act on the body (altered drug action) or genetic 
30 conditions transmitted as single factors altering the way the body acts on drugs (altered 
drug metabolism). These pharmacogenetic conditions can occur either as rare genetic 
defects or as naturally-occurring polymorphisms. For example, glucose-6-phosphate 
dehydrogenase deficiency (G6PD) is a common inherited enzymopathy in which the 
main clinical complication is haemolysis after ingestion of oxidant drugs (anti-malarials. 
35 sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

One pharmacogenomics approach to identifying genes that predict drug response, 
known as "a genome-wide association", relies primarily on a high-resolution map of the 
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human genome consisting of already known gene-related markers (e.g., a "bi-allelic" 
gene marker map which consists of 60.000-100,000 polymorphic or variable sites on the 
human genome, each of which has two variants.) Such a high-resolution genetic map 
can be compared to a map of the genome of each of a statistically significant number of 
5 patients taking part in a Phase Will drug trial to identify markers associated with a 
particular observed drug response or side effect. Alternatively, such a high resolution 
map can be generated from a combination of some ten-million known single nucleotide 
polymorphisms (SNPs) in the human genome. As used herein, a "SNP" is a common 
alteration that occurs in a single nucleotide base in a stretch of DNA. For example, a 

10 SNP may occur once per every 1000 bases of DNA. A SNP may be involved in a 
disease process, however, the vast majority may not be disease-associated. Given a 
genetic map based on the occurrence of such SNPs. individuals can be grouped into 
genetic categories depending on a particular pattern of SNPs in their individual genome. 
In such a manner, treatment regimens can be tailored to groups of genetically similar 

15 individuals, taking into account traits that may be common among such genetically 
similar individuals. 

Alternatively, a method termed the "candidate gene approach", can be utilized to 
identify genes that predict drug response. According to this method, if a gene that 
encodes a drugs target is known (e.g., a TRL protein or TRL receptor of the present 

20 invention), all common variants of that gene can be fairly easily identified in the 

population and it can be determined if having one version of the gene versus another is 
associated with a particular drug response. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
major determinant of both the intensity and duration of drug action. The discovery of 

25 genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 
2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated 
drug response and serious toxicity after taking the standard and safe dose of a drug. 
These polymorphisms are expressed in two phenotypes in the population, the extensive 

30 metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among 
different populations. For example, the gene coding for C YP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently 
experience exaggerated drug response and side effects when they receive standard doses. 

35 If a metabolite is the active therapeutic moiety, PM show no therapeutic response, as 
demonstrated for the analgesic effect of codeine mediated by its C YP2D6-formed 
metabolite morphine. The other extreme are the so called ultra-rapid metabolizers who 
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do not respond to standard doses. Recently, the molecular basis of ultra-rapid 
metabolism has been identified to be due to CYP2D6 gene amplification. 

Alternatively, a method termed the "gene expression profiling", can be utilized to 
identify genes that predict drug response. For example, the gene expression of an animal 
dosed with a drug (e.g., a TRL molecule or TRL modulator of the present invention) can 
give an indication whether gene pathways related to toxicity have been turned on. 

Information generated from more than one of the above pharmacogenomics 
approaches can be used to determine appropriate dosage and treatment regimens for 
prophylactic or therapeutic treatment an individual. This knowledge, when applied to 
dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus 
enhance therapeutic or prophylactic efficiency when treating a subject with a TRL 
molecule or TRL modulator, such as a modulator identified by one of the exemplary 
screening assays described herein. 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references, patents and published patent 
applications cited throughout this application are incorporated herein by reference. 

EXAMPLES 

20 Example 1: Isolation And Characterization Of Murine And Human TRL cDNAs 

In this example, the isolation and characterization of the genes encoding murine 
and human TRL (also referred to as "TANGO 75 ") is described. 

Isolation of the murine TRL cDNA 

25 Poly A+ RNA from primary murine megakaryocyte cells was used to construct a 

cDNA library. The cDNA library was constructed by first and second strand synthesis 
as recommended by the manufacturer for the Gibco BRL kit Superscript Plasmid 
System for cDNA Synthesis and Plasmid Cloning (Gibco/BRL; Bethesda MD). cDNAs 
were ligated into the pMET vector and subject to hightroughput random sequencing 

30 using automated fluorescent dideoxy nucleotide sequencing and dye primer chemistry 
(Applied Biosystems Inc., Foster City CA). 

Isolation of the human TRL cDNA 

The human gene was found through homology with the murine gene. A 
35 GenBank™ search of the dbEST database utilizing a mouse megakaryocyte partial TRL 
cDNA (corresponding to the first 402 nucleotides of SEQ ID NO:l) revealed a human 
cDNA clone with between 80-89% nucleotide identity to the murine partial cDNA. This 
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452bp EST (homosapien clone 280262 3 VEST N49208) represents a reversed clone and 
was not annotated. BlastX searching (BLAST™ searching utilizing a nucleotide 
sequence against a protein database) using this sequence revealed no proteins having 
obvious homology. The homosapien clone 280262 was purchased from Research 
Genetics (Huntsville, AL) as part of the IMAGE Consortium. Both the human and 
mouse clone were fully sequenced (SEQ ID NO: 1 and SEQ ID NO:3) and represent 
alternate splice forms of the same gene. 

A GenBank™ search using the human TRL I nucleotide sequence of SEQ ID 
NO:3 revealed EST N49208. which is 452 nucleotides in length, is 97% identical to 
nucleotides 23-452 of the human TRL I nucleotide sequence depicted in Figure 2 and 
SEQ ID NO:3. The reverse complement of EST N50261. which is 438 nucleotides in 
length, is 99% identical to nucleotides 2161-2598 of the human TRL I nucleotide 
sequence which corresponds to the 3' untranslated region of the gene. 

A GenBank™ search using the murine TRL nucleotide sequence of SEQ ID 
NO:l revealed eight EST sequences, four human AA351536, D59902. AA357231. and 
AA374471) and four mouse (AA239755, AA271351. AA072902. and R74815) which 
were similar to different regions of the nucleotide sequence of SEQ ID NO: 1 . As no 
reading frame can be determined from an EST (such as an EST identified in the above 
database searches), an amino acid sequence encoded by the EST cannot be determined. 

Example 2: Distribution Of TRL mRNA In Mouse And Human Tissues 

Northern Blot Analysis 

An - 1 .2kb EcoRI/XhoI fragment from human TRL (that corresponds to the open 
reading frame) was used as a probe for Northern blots. The fragment was labeled using 
the Prime It kit from Stratagene (La Jolla, CA) and then hybridized to multi-tissue 
northern blots from Clontech (Palo Alto, CA) as recommended by the manufacturer. In 
human an approximate 4.2kb transcript was detected in most tissues but was more 
prominently expressed in kidney, brain, placenta and colon. In mouse, a 4.2kb transcript 
was also found in most tissues and was most prominently expressed in kidney and brain. 

In situ Analysis 

For detection of TRL mRNA, in situ, hybridization was performed on various 
sections from mouse, human, and monkey tissues. Sections and smears were postfixed 
with 4% formaldehyde in DEPC-treated IX phosphate-buffered saline (PBS) at room 
temperature for 10 min before being rinsed twice in DEPC-treated PBS and once in 0.1 
M triethanolamine-HCl (pH 8.0). Following incubation in 0.25% acetic anhydride-0.1 
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M triethanolamine-HCl for 10 min, sections were rinsed in DEPC 2X SSC (IX SSC is 
0.1 5M NaCl. 0.0 15M sodium citrate). Tissues were dehydrated through a series of 
ethanol washes, incubated in 100% chloroform for 5 min ? rinsed in 100% ethanol for 1 
min followed by 95% ethanol for 1 min and allowed to air dry. 
5 The hybridization was performed using a 35 S-radiolabeled cRNA probe 

coresponding to nucleotides 406 to 930 (amino acids 73-247) of human TRL II. 
Tissues were incubated with probe (approximately 5 X 10 7 cpm/ml) in the presence of a 
solution containing 600 mM NaCL 10 mM Tris (pH 7.5), 1 mM EDTA, 0.01% sheared 
salmon sperm DNA. 0.01% yeast tRNA r 0.05% yeast total RNA type XI, 1 X 

10 Denhardf s solution. 50% formamide, 10% dextran sulfate. 100 mM dithiothreitoL 0.1% 
sodium dodecyl sulfate (SDS), and 0.1% sodium thiosulfate for 18 h at 55°C. 

After hybridization, slides were washed with 2 X SSC. Sections were then 
sequentially incubated at 37°C in TNE (a solution containing 10 mM Tris-HCl (pH 7.6), 
500 mM NaCl. and 1 mM EDTA), for 10 min, in TNE with lOjig of RNase A per ml for 

15 30 min, and finally in TNE for 10 min. Slides were then rinsed with 2 X SSC at room 
temp, washed with 2 X SSC at 50°C for 1 h ? washed with 0.2 X SSC at 55°C for 1 h, and 
0.2 X SSC at 60°C for 1 h. Sections were then dehydrated rapidly through serial 
ethanol-0.3 M sodium acetate concentrations before being air dried and exposed to 
Kodak Biomax MR scientific imaging film. 

20 

The following tissues expressed TRL: 
Adult Mouse Expression 

brain; cortex-neuronal expression (no glial cell expression) 
25 cerebellum-purkinje cells (no granule cell expression) 

hippocampus-CAl. CA2. possibly CA3 (no dentate gyrus expression) 
brainstem 
bladder: epithelial cell layer 
transverse colon 
30 descending colon 

colon (paraffin): goblet cells 

kidney (signal seen in the cortex but not the medulla): proximal convoluted tubules, 

not in glomeruli or medulla 
stomach: glandular region 

35 
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Embryonic Mouse Expression 

brain liver (signal decreases significantly b/w E14.5 and PL5) 

trigeminal ganglion kidney 
retina (photoreceptor layer) lung 
5 olfactory epithelium intestines 

spinal cord stomach:glandular region 

submaxillary gland 

Adult Human Expression 

10 human pancreas: multifocal signal as seen in monkey pancreas consistent with islets as 
most likely source of signal. 

human tonsil: strong signal in a pattern suggestive of stratified squamous epithelial cell 

layer (pharyngeal mucosa). 

human brain (section of cortex): no signal 
15 human spleen: no signal 

human skeletal muscle: no signal 

human small intestine: signal as observed in previous experiment and determined to be 

the Peyers patches (specific cell type to be determined), 
human colon: expression in surface mucosal epithelium (may be in mucus). 
20 human kidney - medulla 

Adult Monkey Expression 

monkey stomach: strong signal observed in the mucosal layer 

monkey pancreas: multifocal signal as observed in previous experiment and determined 
25 to be the islets (specific cell type of the islets yet to be determined). 

Example 3: Chromosomal Mapping Of The Murine And Human TRL Genes 

This Example describes the chromosomal mapping of the human and murine 
TRL genes. 



30 



35 



Chromosomal Mapping of the human TRL Gene 

Oligos used: 
T75F2 

5' - tccctgacaacacaagctca (SEQ ID NO: 14) 
T75R3 

5' - tccatttctcccggttctg (SEQ ID NO: 15) 
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A number of oligonucleotide primers were designed unique to the human 3 ? UTR, 
which allowed mapping using the Stanford Human Genome Center's G3 radiation hybrid 
panel, and the Whitehead Institute/MIT Center for Genome Research's GENEBRIDGE 4 
radiation hybrid panel (Research Genetics. Huntsville, AL). These maps were built with 
5 STSs, ESTs, polymorphic markers and genes. The approximate location of the human 
TRL gene was determined to be on Chromosome 6p25 by Stanford panel. The LOD 
score from the Whitehead panel was not as good as the Stanford panel. 

An aliquot of DNA (lOul) from the radiation hybrid panels was amplified for 30 
cycles on an MJ Research thermocycler in a 20ul reaction containing: 500ng primers. 
10 2.5U Taq (Perkin Elmer) and final buffer concentrations of .2mM dNTPs, 1.5mM 
MgCb. (PCR Program: Step 1 95°C 2min., Step 2 95°C lmin.. Step 3 60°C lmin. ? 
Step 4 72°C lmin.. Goto Step 2 a 29 times. 4°C indefinite.) PCR products were 
analyzed by electrophoresis through 1.5% agarose gels. 

15 Chromosomal Mapping of the murine TRL Gene 

The TRL gene was mapped utilizing the Mus spretus/C57BL/6J backcross. T75 
is located 12cM distal of D17MIT48 and 20cM proximal of D17MIT9. This region is 
syntenic to 6p21 in human. PCR primers were used to amplify mouse genomic DNA 
using standard techniques. Primers were designed from noncoding sequences of murine 
20 T75 and were as follows: 

Forward Primer: 

5' CC ATGTTGACCGTACCACS' (SEQIDNO:16) 



25 Reverse Primer: 

5' GCACTCTCGGTCAGTCAA3' (SEQIDNO:17) 

Amplification conditions were 35 cycles at 95°C for 40 seconds, 55°C for 50 seconds 
and 72°C for 50 seconds. Samples were run on denaturing 10% SSCP gel at 3W and 4° 
30 C for 16 hours. 

Example 4: Characterization Of TRL Proteins 

In this example, the predicted amino acid sequnces of the murine and human 
TRL proteins were compared to amino acid sequences of known proteins and various 
35 motifs were identified. In addition, the hydrophobicity of the murine and human TRL 
proteins were predicted. 
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The murine TRL cDNA encodes a protein of 573 amino acids (predicted MW of 
63kDa ? not including post-translational modifications), with a predicted transmembrane 
sequence (aa270-288). The human TRL cDNA encodes a 253 amino acid protein with a 
predicted molecular weight of 27kDa (not including post-translational modifications). A 
signal peptide is predicted to exist from aa 1-4L using the prediction program SIGN ALP 
(Henrik Nielsen, Jacob EngelbrechL Soren Brunak and Gunnar von Heijne 
"Identification of prokaryotic and eukaryotic signal peptides and prediction of their 
cleavage sites." (1997) Protein Engineering 10. 1-6). The human TRL I protein appears 
to be secreted and there is no evidence of a transmembrane domain. 

Alignment of the murine and human TRL proteins using GAP alignment tool of 
the GCG package (Genetic Computer Group, Madison, WI) reveals that the two proteins 
are 81% identical (86% similarity) over the amino acids encoding the cysteine-rich 
domains (mouse aa 1-171; human aa 83-253). At the nucleotide level the two cDNA's 
are 64.5% identical, although there are two regions of high homology (approx 87%) 
corresponding to nucleotides 191-842 in mouse (nt 283-934 in human) and nucleotides 
890-1602 (nt 942-1654 in human). 

It is anticipated that both splice forms (soluble and membrane bound) of the 
protein are present in human and mouse. Both proteins contain the cysteine patterning 
characteristic to members of the TNFR family. 

Example 5: Preparation Of TRL Protein. 

Recombinant TRL was produced in Pichia pastoris. A form lacking the signal 
sequence (aa 42-253) was cloned into pPicZ (Invitrogen. San Diego. CA) in frame with a 
yeast signal sequence using the following primers; 

t75R15p; 5* TTTTGAATTCCAGCCAGAACAGAAGGCCTCGA 3' (SEQ ID NO:18) 
and 

t75xba3p; 5* TTTTTCTAGATACCTTTGGTCTTTGGGAAC 3* (SEQ ID NO: 19) 

Transformation, expression and purification of the recombinant protein was then 
performed essentially'as described by the manufacturer (Invitrogen, San Diego. CA). 

In another example the full open reading frame (ORE) of human TRL I (aa 1- 
253) was expressed as an Fc fusion in pCD5. 

A further example is the over-production of full length human TRL in vivo 
mediated by retroviral infection. The sequence for human TRL (aa 1-253) was amplified 
using the following primers; 
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T75pwzf; 5 ? AAAAAAGAATTCGCCGCCATGGGGACCTCT 3 T (SEQ ID NO:20) 
and 

T75pwzr; 5' CTTGTCGTCGTCGTCCTTGTAGTCGTACCTTTG 3 ; (SEQ IDNO:21) 

5 The reverse primer places an epitope tag (Flag sequence) on the 3* end of the protein. 
Amplified products were then subcloned into the retroviral pWZLBlast vector, and 
sequence verified. Bone marrow infected with the retrovirus is then transplanted into 
irradiated mouse recipients and pathology reviewed after 6 weeks. 

10 Example 6: Isolation of Human TRL II cDNA and Characterization of 
Corresponding Protein 

A human Hela cell cDNA library was screened as described in Maniatis, T. 
Molecular Cloning: A Laboratory Manual. 2nd, ed. Cold Spring Harbor Laboratory, 
Cold Spring Harbor Laboratory Press. Cold Spring Harbor, NY (1989) for alternate 

1 5 splice forms of TRL with a 1 .3 kb fragment encompassing the 5' end of the mouse TRL 
cDNA. A 2.6 kb cDNA clone was isolated and fully sequenced and the nucleotide 
sequence is set forth in SEQ ID NO:22. This clone encodes a protein of 605 amino acids 
with a predicted molecular weight of 66.2 kDa, including signal peptide and without 
posttranslational modifications. The signal peptide is encoded by amino acids 1-41 of 

20 human TRL II (SEQ ID NO:23) and a putatitive transmembrane domain exists between 
amino acids 352 and 370. In the extracellular region of the molecule there are 6 
potential N-linked glycosylation sites centered at amino acids 82. 141. 252. 257, 278 and 
289, A putative death-domain exists (PROSITE: PDOC50017) in the intracellular 
region of the molecule at about amino acids 415-498. There are 3 potential protein 

25 kinase C phosphorylation sites at amino acids 44 L 467 and 506. 

Alignment of murine and human TRL-II proteins using GAP alignment tool of 
the GCG package reveals that the two proteins are 81% identical (84% similar). The 
alignment is set forth below: 

30 Example 7: Apopotic Effect of TRL II Protein. 

To ascertain if the clone containing full-length human TRL II could induce 
apoptotic signal, an over-expression model similar to that described by Kumar et al 
(1994) was used. Briefly the SW480 colorectal carcinoma cell line was plated into 6- 
well tissue culture plates at a density of 4 x 10^ cells/well. The next day cells were 
35 transfected with the reporter gene pSVp (Clontech) and test construct (human TRL II) or 
control construct (pMET) ; using lipofectamine (GIBCO). 36 hours post-transfection the 
cells were fixed and stained for p-gal activity and the percentage of cells expressing B- 
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gal ascertained. The percentage of P-gal positive staining cells in control plates was 
approximately 18.8% whereas in plates transfected with human TRL II. only 9.1% of 
cells that were p-gal positive. This decrease is statistically significant. Additional 
experiments indicate that overexpression of TRL II is capable of inducing cell death in 
SW480 cells. 

Example 8: Construction and Testing of TRL Fusion Proteins to Direct Secretion 
of a Heterologuous Protein 

The human TRL signal sequence (nucleotides 1 87 - 3 16 or SEQ ID NO:3, amino 
acid residues 1 - 42 of SEQ ID NO:4) was attached to the N-terminal end of the mature 
human OB receptor (leptin receptor) extracellular domain fused at the C-terminus to 
human IgG Fc domain. In a parallel experiment, the mouse OB receptor signal sequence 
was attached to the N-terminal end of the mature human OB receptor extracellular 
domain fused at the C-terminus to human IgG Fc domain. 

In particular, the amino acid sequence of the C-terminal Fc portion is as follows: 

DPEEPKSCDKTHTCPPCPAPELLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSH 

EDPEVKFNWYVDGVEVHNAKTKPREEQTNSTYRVVSVLTVLHQDWLNGKEYK 

CKVSNKALPAPIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPS 

DIAVEWESNGQPENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVM 

HEALHNHYTQKSLSLSPGK (SEQ ID NO:32) 

The amino acid sequence of the human OB receptor (portion) included in all constructs 
is as follows: 

NLSYPITPWRFKLSCMPPNSTYDYFLLPAGLSKNTSNSNGHYETAVEPKFNSSGT 

HFSNLSKTTFHCCFRSEQDRNCSLCADNIEGKTFVSTVNSLVFQQIDANWNIQC 

WLKGDLKLFICYVESLFKNLFRNYNYKVHLLYVLPEVLEDSPLVPQKGSFQMV 

HCNCSVHECCECLVPVPTAKLNDTLLMCLKITSGGVIFQSPLMSVQPINMVKPDP 

PLGLHMEITDDGNLKISWSSPPLVPFPLQYQVKYSENSTTVIREADKIVSATSLLV 

DSILPGSSYEVQVRGKRLDGPGIWSDWSTPRVFTTQDVIYFPPKILTSVGSNVSFH 

CIYKKENKIVPSKEIVWWMNLAEKIPQSQYDVVSDHVSKVTFFNLNETKPRGKF 

TYDAVYCCNEHECHHRYAELYVIDVNINISCETDGYLTKMTCRWSTSTIQSLAE 

STLQLRYHRSSLYCSDIPSIHPISEPKDCYLQSDGFYECIFQPIFLLSGYTMWIRINH 

SLGSLDSPPTCVLPDSVVKPLPPSSVKAEITINIGLLKISWEKPVFPENNLQFQIRY 

GLSGKEVQWKMYEVYDAKSKSVSLPVPDLCAVYAVQVRCKRLDGLGYWSNW 

SNPAYTVVMDIKVPMRGPEFWRIINGDTMKKEKNVTLLWKPLMKNDSLCSVQR 

YVINHHTSCNGTWSEDVGNHTKFTFLWTEQAHTVTVLAINSIGASVANFNLTFS 

WPMSKVNIVQSLSAYPLNSSCVIVSWILSPSDYKLMYFIIEWKNLNEDGEIKWLR 
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ISSSVKKYYIHDHFIPIEKYQFSLYPIFMEGVGKPKIINSFTQDDIEKHQ (SEQ ID 
NO:33). 

Constucts were made in the order: signal sequence/hOB-R/Fc. Either a single glycine or 
5 three alanines were used as a linker between the hOB-R and Fc sequences. 

These constructs and the parent construct containing the native human OB 
receptor signal sequence were transiently transfected into HEK 293 cells and supernatant 
were harvested after 72 hours. Equivalent samples were run on SDS PAGE, transferred 
to nylon membranes and western blotted with anti human Fc antibodies. The sample 
10 prepared using the human OB receptor signal sequence gave no detectable band. The 
mouse OB receptor and the human TRL signal peptides gave detectible amounts of 
immunoreactivity which were quantitated by densitometry. The results show that the 
heterologuous TRL signal sequence has elevated the amount of secreted human OB 
receptor protein in the supernatants approximately 10 fold. 
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Equivalents Those skilled in the art will recognize, or be able to ascertain using no 
more than routine experimentation, many equivalents to the specific embodiments of the 
invention described herein. Such equivalents are intended to be encompassed by the 
20 following claims. 
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What is claimed is: 

1 . An isolated nucleic acid molecule selected from the group consisting of: 

a) a nucleic acid molecule comprising a nucleotide sequence which is at 
least 60% homologous to a nucleotide sequence of SEQ ID NO:l, SEQ ID NO:3, 
SEQ ID NO:22, the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98544, or a complement thereof; 

b) a nucleic acid molecule comprising a fragment of at least 500 nucleotides 
of a nucleic acid comprising the nucleotide sequence of SEQ ID NO: 1 , SEQ ID 
NO:3. SEQ ID NO:22. the DNA insert of the plasmid deposited with ATCC as 
Accession Number 98544, or a complement thereof: 

c) a nucleic acid molecule which encodes a polypeptide comprising an 
amino acid sequence at least about 60% homologous to the amino acid sequence 
of SEQ ID NO:2, SEQ ID NO:4 ? SEQ ID NO:23 ? or an amino acid sequence 
encoded by the DNA insert of the plasmid deposited with ATCC as Accession 
Number 98544; 

d) a nucleic acid molecule which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:2, SEQ ID NO:4, SEQ ID 
NO:23. or the polypeptide encoded by the DNA insert of the plasmid deposited 
with ATCC as Accession Number 98544. wherein the fragment comprises at 
least 15 contiguous amino acid residues of the amino acid sequence of SEQ ID 
NO:2. SEQ ID NO:4. SEQ ID NO:23. or the polypeptide encoded by the DNA 
insert of the plasmid deposited with ATCC as Accession Number 98544; 

e) a nucleic acid molecule which encodes a naturally occurring allelic 
variant of a polypeptide comprising the amino acid sequence of SEQ ID NO: 2, 
SEQ ID NO:4. SEQ ID NO:23. or an amino acid sequence encoded by the DNA 
insert of the plasmid deposited with ATCC as Accession Number 98544, wherein 
the nucleic acid molecule hybridizes to a nucleic acid molecule comprising SEQ 
ID NO: I, SEQ ID NO:3 ? or SEQ ID NO:24 under stringent conditions; and 

f) a nucleic acid molecule which hybridizes under stringent conditions to a 
nucleotide sequence of SEQ ID NO:l, SEQ ID NO:3 ? SEQ ID NO:22, the DNA 
insert of the plasmid deposited with ATCC as Accession Number 98544, or a 
complement thereof. 
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2. The isolated nucleic acid molecule of claim 1 which is selected from the group 
consisting of: 

a) a nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
NO:l. SEQ ID NO:3, SEQ ID NO:22 ? or the DNA insert of the plasmid 

5 deposited with ATCC as Accession Number 98544. or a complement thereof; 

and 

b) a nucleic acid molecule which encodes a polypeptide comprising the 
amino acid sequence of SEQ ID NO:2 ? SEQ ID NO:4, SEQ ID NO:23, or an 
amino acid sequence encoded by the DNA insert of the plasmid deposited with 

1 0 ATCC as Accession Number 98544, 

3. The nucleic acid molecule of claim 1 further comprising vector nucleic acid 
sequences. 

15 4. The nucleic acid molecule of claim 1 further comprising nucleic acid sequences 
encoding a heterologous polypeptide. 

5. A host cell which contains the nucleic acid molecule of claim 1 . 
20 6. An isolated polypeptide selected from the group consisting of: 

a) a fragment of a polypeptide comprising the amino acid sequence of SEQ 
ID NO:2. SEQ ID NO:4. SEQ ID NO:23. or the polypeptide encoded by the 
DNA insert of the plasmid deposited with ATCC as Accession Number 98544, 

25 wherein the fragment comprises at least 1 5 contiguous amino acids of SEQ ID 

NO:2 ? SEQ ID NO:4. SEQ ID NO:23. or the amino acid sequence encoded by the 
DNA insert of the plasmid deposited with ATCC as Accession Number 98544; 

b) a naturally occurring allelic variant of a polypeptide comprising the 
amino acid sequence of SEQ ID NO:2 ? SEQ ID NO:4. SEQ ID NO:23, or an 

30 amino acid sequence encoded by the DNA insert of the plasmid deposited with 

ATCC as Accession Number 98544, wherein the polypeptide is encoded by a 
nucleic acid molecule which hybridizes to a nucleic acid molecule comprising 
SEQ ID NO:K SEQ ID NO:3. or SEQ ID NO:22 under stringent conditions; and 

c) a polypeptide which is encoded by a nucleic acid molecule comprising a 
35 nucleotide sequence which is at least 60% homologous to a nucleic acid 

comprising the nucleotide sequence of SEQ ID NO: 1 . SEQ ID NO:3, SEQ ID 
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NO:22. or the DNA insert of the plasmid deposited with ATCC as Accession 
Number 98544; 

d) a polypeptide which is encoded by a nucleic acid molecule which 
hybridizes under stringent conditions to a nucleotide sequence of SEQ ID NO:L 
SEQ ID NO:3, SEQ ID NO:22 ? or the DNA insert of the plasmid deposited with 
ATCC as Accession Number 98544; and 

e) a polypeptide comprising an amino acid sequence which is at least 60% 
homologous to the amino acid sequence of SEQ ID NO:2. SEQ ID NO:4 ? SEQ 
ID NO:23, or the polypeptide encoded by the DNA insert of the plasmid 
deposited with ATCC as Accession Number 98544. 

7. The isolated polypeptide of claim 6 comprising the amino acid sequence of SEQ 
ID NO:2, SEQ ID NO:4, SEQ ID NO:23. or an amino acid sequence encoded by the 
DNA insert of the plasmid deposited with ATCC as Accession Number 98544. 

8. The polypeptide of claim 6 further comprising heterologous amino acid 
sequences. 

9. An antibody which selectively binds to a polypeptide of claim 6. 

10. A method for producing a polypeptide selected from the group consisting of: 

a) a polypeptide comprising the amino acid sequence of SEQ ID NO: 2, 
SEQ ID NO:4, SEQ ID NO:23. or an amino acid sequence encoded by the DNA 
insert of the plasmid deposited with ATCC as Accession Number 98544; 

b) a fragment of a polypeptide comprising the amino acid sequence of SEQ 
ID NO:2, SEQ ID NO:4. SEQ ID NO:23. or an amino acid sequence encoded by 
the DNA insert of the plasmid deposited with ATCC as Accession Number 
98544 wherein the fragment comprises at least 15 contiguous amino acids of 
SEQ ID NO:2. SEQ ID NO:4 ? SEQ ID NO:23 ? or the amino acid sequence 
encoded by the DNA insert of the plasmid deposited with ATCC as Accession 
Number 98544; and 

c) a naturally occurring allelic variant of a polypeptide comprising the 
amino acid sequence of SEQ ID NO:2 r SEQ ID NO:4, SEQ ID NO:23, or an 
amino acid sequence encoded by the DNA insert of the plasmid deposited with 
ATCC as Accession Number 98544, wherein the polypeptide is encoded by a 
nucleic acid molecule which hybridizes to a nucleic acid molecule comprising 
SEQ ID NO:l. SEQ ID NO:3. or SEQ ID NO:22 under stringent conditions; 
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comprising culturing the host cell of claim 5 under conditions in which the 
nucleic acid molecule is expressed. 

11. A method for detecting the presence of a polypeptide of claim 6 in a sample 
comprising; 

a) contacting the sample with a compound which selectively binds to the 
polypeptide; and 

b) determining whether the compound binds to the polypeptide in the sample 
to thereby detect the presence of a polypeptide of claim 6 in the sample. 

12. The method of claim 1 L wherein the compound which binds to the polypeptide 
is an antibody. 

13. A kit comprising a compound which selectively binds to a polypeptide of claim 6 
and instructions for use. 

14. A method for detecting the presence of a nucleic acid molecule in claim 1 in a 
sample comprising: 

a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and 

b) determining whether the nucleic acid probe or primer binds to a nucleic 
acid molecule in the sample to thereby detect the presence of a nucleic 
acid molecule of claim 1 in the sample. ■ ' ' ' 

15. The method of claim 14, wherein the sample comprises mRNA molecules and is 
contacted with a nucleic acid probe. 

16. A kit comprising a compound which selectively hybridizes to a nucleic acid 
molecule of claim 1 and instructions for use. 
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1 7. A method for identifying a compound which binds to a polypeptide of claim 6 
comprising: 

a) contacting the polypeptide, or a cell expressing the polypeptide with a test 
compound; and 

5 b) determining whether the polypeptide binds to the test compound. 

1 8. The method of claim 1 7, wherein the binding of the test compound to the 
polypeptide is detected by a method selected from the group consisting of: 

a) detection of binding by direct detection of test compound/polypeptide 
10 binding; 

b) detection of binding using a competition binding assay; and 

c) detection of binding using an assay for TRL activity. 

19. A method of modulating the activity of a polypeptide of claim 6 comprising 

1 5 contacting the polypeptide or a cell expressing the polypeptide with a compound which 
binds to the polypeptide in a sufficient concentration to modulate the activity of the 
polypeptide. 

20. A method for identifying a compound which modulates the activity of a 
20 polypeptide of claim 6 comprising: 

a) contacting a polypeptide of claim 6 with a test compound; and 

b) determining the effect of the test compound on the activity of the 
polypeptide to thereby identify a compound which modulates the activity 
of the polypeptide. 

25 
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Figure 5 

Sequence Stare End Sequencetype 

huTango75 II Q > 60S) PROTEIN 
huTango75 I (1 > 253) PROTEIN 

muTango 75 {1 > S73) PROTEIN 

1 60 
huTango75 1 1 MGTSPSSSTAIJVSCSRIAilRATATMIAGSLLLLGFLSTTTAOPEO/CASi^raTyfiHVD^ 
huTango7 5 I MGTSPSSSTAIASCSRIAJUUVTATMIAGSLLLLGFLSTTTAGPSOKASAT^IGTy^HVDRA 
muTango 75 M 

61 120 
huTango75 II TG Q VL TCDKCPA G T YVS EH C TNTS L R VCS SCP VG TFTR HENGIEKCHDCSO PC PWPMI EK 
huTango7 5 I TGQVLTCDKC PAGTYVS EHCTNTSLRVCS SCPVGTFTRHENGI EKCHDCSOPCPWPMI EK 
muTango 75 SLRVCS SCPAGTFTRHENGIERCHDCSOPCPWPMIER 

121 180 
huTango75 II L PCAAL TDR EC TCPPGM FQSNA TCA PHTVCPVGWGVRKKGTETEDVRC kQCARGTFSDVP 
huTango7 5 I L PCAAL TDREC TCP PGMFQSNA TCA PHTVCP VGWGVRKKGTETEDVRC KQCARGTFSDVP 

muTango 7 5 LPCAALTDREC ICPPGMYQSNGTCAPHTVCPVGWGVRKKGTENEDVRC KQCARGTFSDVP 

181 240 
huTango75 II SSVMKCKAYTDCLSQNLWIKPGTKETDNVCG V LPS FSSSTS PS PGTA I FPRPRHMFTtiF 
huTango75 I SSVMKCKA Y TDCL S ONL, W I KPGTKETDNVCGT LPSFSSSTS PS PGTA I FPRPEHMFTtiF 

muTango 75 SSVMKCKAHTDCLGQNLEVVKPGTKETDt/VCGM RLFFSSTKPPSSGTVTFS^ 

241 300 
huTango75 II VPSSTYVPKGMNSTESNSSAS VRPKVLSS IQEGTVPDNTSSARGKEDVNKTLPNLQWNH 
huTango75 I VPSSTYVPK 

muTango 7 5 VPSSTYEPQGMNSTDSNSTASVRTKVPSGIEEGTVPDNTSSTSGKEGTNRTLPNPPQVTH 

301 360 
huTango75 II QQGPHHRHILKLLPSMEATGGEKSSTPIKGPKRGHPRQNLHKHFDINEHLP WMIVLFLLL 

huTango75 I _ 

muTango 7 5 QQAPHHRHILKLLPSSMEATGEKSSTAI KAPKRGHPRQNAHKHFDINEHLP WMIVLFLLL 



361 420 
huTango75 II VLVyrvvC^IRKSSRTLKKGPRQDPSAI VEKAGLKKSMTPTQNREKWI YYCNGHGIDILK 

huTango75 I 

muTango 75 VLVLIVVCSIRKSSRTLKKGPRQDPSAIVEKAGLKKS LTPTQNREKWI YYRNGHGIDILK 

421 480 
huTango75 II LVAAQVGSQWKDI YQFLCNASEREVAAFSNGYTADHERAYAALQHWTIRGPEASLAQLIS 

huTango75 I DQRY- 

muTango 75 LVAAQVGSQWKDI YQFLCNASEREVAAFSNGYTADHERAYAALQHWTIRGPEASLAQLIS 

4 81 540 
huTango75 II ALRQHRRNDWEKIRG LMEDTTQLETDKLALPMSPSPLSPSPI PSPNAKLENSALLTVEP 

huTango75 I _ 

muTango 75 ALRQHRRNDWEKIRG LMEDTTQLETDKLALPMSPSPLSPSPIPSPNVKLENSTLLTVEP 

541 600 

huTango75 II SPQD LL FKWLDNWATKELELHLL - -GFELFW 

huTango75 I __ 

muTango 7 5 SPLDKNKCF FVDESEPLLRCDSTSSGSSALSRNGSFITKEKKDTVLROVRLDPCDLQPIF 

601 655 

huTango75 1 1 NTLLHFGKS- - - KSSASGALSIENLPS- FAJLKDV LFFIYT 

huTango75 I - 

muTango 75 DDMLHI LNPEELRVI EE I PQAEDKLDRLFE I IGVKSQEASQTLLDSVYSHLPDLL 
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Al ignmenc of : 



human TRL I I DD (1 84) PROTEIN 

murine TRL DD (1 > 84) PROTEIN 

1 10 20 30 40 50 * * s : 

human TNFR1 DD PATLYAVVENVPPLRWKEFVRJRLGLSDHEIDRLELONGR CLRKAQ YSMLATWRR 

human FAS DD SKYI TTIAGVMTLSQVXGFVRKNGVNEAKXDEIKNDNVQ DTABQKVQLLRNWHQ 

human TRADD DD SLKDOOTFARSVGLKWRKVGRSLORGCRALRDPAX.DSLAYEYEREGLYBOAFOLLRRFV0 

human FADD DD LCAAFNVICDNVGKDWRRLARQLKVSDTKIDSIEDRYPR NLTERVRESLR IWKN 

human RIP DD TDKHLDPIRENLGKHWKNCARKXGFTQSQrDEIDHDYERD GLKBKVYQMLQKWVM 

human TRL II DD G IDI LKLVAAQVGSQWKDIYQFLCNASEREVAAFSNG YTAD - HERAYAALQHWTI 

murine TRL DD G ID I LKLVAAQVGSQWXDI YQFLCNAS ERE VAAFSNG YTAD HERAYAALQHWT I 

70 * * 80 

human TNFR1 DD RTPRREATLELLGRVLRDMDLLGCLBDIEEAL 

human FAS DD LHG- KKEAYDTLI KDl*KXANLCTLAEKIQTI I 

Human TRADD DD AEG - RRATLQRLVEALEENELTSLABDLLGLT 

human FADD DD TEK- ENATVAHLVGALRSCQMNLVADLVQEVQ 

human RIP DD REGI KGATVGKLAQAL.HQCSRIDLLSSLI YVS 

human TRL 1 1 DD R -GP- EASLAQLISALRQHRRNDWBKIRGLM 

murine TRL DD R -GP - EASLAQLISALRQHRRNDWEKIRGLM 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT : 

(A) NAME: MILLENIUM B IOTHERAPEUTICS , INC. 

(B) STREET : 620 MEMORIAL DRIVE 

(C) CITY: CAMBRIDGE 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: US 

(F) POSTAL CODE: 

(G) TELEPHONE: 

(H) TELEFAX: 

(ii) TITLE OF INVENTION: NOVEL MOLECULES OF THE TNF RECEPTOR SUPERFAMILY 

AND USES THEREFOR 

(iii) NUMBER OF SEQUENCES: 2 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LAHIVE & COCKFIELD , LLP 

(B) STREET: 23 STATE STREET 

(C) CITY: 30STON 

(D) STATE: MASSACHUSETTS 

( E ) COUNTRY : US 

(F) ZIP: 02109 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : PCT/US 98 / 

(B) FILING DATE: 25 SEPTEMBER 1998 

(C) CLASSIFICATION: 
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(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/938,896 

(B) FILING DATE: 2 6 SEPTEMBER 19 97 

5 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/042,785 

(B) FILING DATE: 17 MARCH 1998 

(viii) ATTORNEY /AGENT INFORMATION: 
10 (A) NAME: MANDRAGOURAS , AMY E 

(B) REGISTRATION NUMBER: 36,207 

(C) REFERENCE / DOCKET NUMBER: MEI-001CPPC 

(ix) TELECOMMUNICATION INFORMATION: 
15 (A} TELEPHONE: (617)227-7400 

(B) TELEFAX: (617)742-4214 



20 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3331 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 
30 (A) NAME /KEY : CDS 

(B) LOCATION: 344.. 2065 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

35 GTCGACCCAC GGTCCGGGAG ACTTACCACC AAGTAGCAGG ATCTTCTCTT TCTCAATTTC 6 0 

CAATATGAAA TTAAATTTCC CAACAAGAAA ACCAACCACT ATCCATCGCC AACCACCTCT 12 0 



BNSDOCID:<WO 9915663A1> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/15663 



PCTAJS98/20219 



GCCCCTACTT TCAGACTCAG AAGGAAGAAA ACTAAGTATA TCGTAAACTC TAAGGAGGAA 18 0 

ACCTCAAGAA CCGCTTGGAT TCCTCAGCAC CATCACAGCT CAACCAGAAC AAAAGACTCT 24 0 

GAGTCTCCCT GGCACCTACC GCCATGTTGA CCGTACCACT GGCCAGGTGC TAACCTGCGA 3 00 



CAAGTGCCCA GCAGGAACGT ATGTCTCCGA GCACTGTACC AAC ATG AGC CTG CGA 35 5 

Met Ser Leu Arg 
1 



GTC TGC AGC AGC TGC CCC GCG GGG ACC TTT ACC AGG CAC GAG AAC GGC 4 03 

Val Cys Ser Ser Cys Pro Ala Gly Thr Phe Thr Arg His Glu Asn Gly 
5 10 15 20 



ATA GAG AGA TGC CAT GAC TGT AGT CAG CCA TGT CCA TGG CCG ATG ATT 451 
lie Glu Arg Cys His Asp Cys Ser Gin Pro Cys Pro Trp Pro Met lie 
25 30 35 



GAG AGA TTA CCT TGT GCT GCC TTG ACT GAC CGA GAG TGC ATC TGC CCA 4 99 

Glu Arg Leu Pro Cys Ala Ala Leu Thr Asp Arg Glu Cys lie Cys Pro 
40 45 50 



CCT GGA ATG TAT CAG TCT AAT GGT ACC TGC GCT CCC CAT ACA GTG TGC 54 7 

Pro Gly Met Tyr Gin Ser Asn Gly Thr Cys Ala Pro His Thr Val Cys 
55 60 65 



CCC GTG GGC TGG GGT GTG CGG AAG AAA GGG ACA GAG AAT GAA GAT GTG 5 95 

Pro Val Gly Trp Gly Val Arg Lys Lys Gly Thr Glu Asn Glu Asp Val 
70 75 80 



CGC TGT AAG CAG TGC GCT CGG GGT ACC TTC TCT GAC GTG CCT TCC AGT 643 
Arg Cys Lys Gin Cys Ala Arg Gly Thr Phe Ser Asp Val Pro Ser Ser 
85 90 95 100 
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GTG ATG AAG TGT AAA GCT CAC ACG GAC TGT CTG GGT CAG AAC GTG GAG 6 91 

Val Met Lys Cys Lys Ala His Thr Asp Cys Leu Gly Gin Asn Leu Glu 
105 110 115 

5 GTG GTC AAG CCA GGG ACC AAG GAG ACA GAC AAC GTC TGT GGC ATG CGC 73 9 

Val Val Lys Pro Gly Thr Lys Glu Thr Asp Asn Val Cys Gly Met Arg 
120 125 130 

CTG TTC TTC TCC AGC ACA AAC CCA CCT TCC TCT GGC ACA GTT ACC TTT 73 7 

10 Leu Phe Phe Ser Ser Thr Asn Pro Pro Ser Ser Gly Thr Val Thr Phe 
135 140 145 

TCT CAC CCT GAG CAT ATG GAA TCC CAC GAT GTC CCT TCC TCC ACC TAT 83 5 

Ser His Pro Glu His Met Glu Ser His Asp Val Pro Ser Ser Thr Tyr 
15 150 155 160 

GAG CCC CAA GGC ATG AAC TCA ACA GAT TCC AAC TCT ACT GCC TCT GTT 3 83 

Glu Pro Gin Gly Met Asn Ser Thr Asp Ser Asn Ser Thr Ala Ser Val 
165 170 175 180 

20 

AGA ACT AAG GTA CCA AGT GGC ATC GAG GAA GGG ACA GTG CCT GAC AAT 931 

Arg Thr Lys Val Pro Ser Gly lie Glu Glu Gly Thr Val Pro Asp Asn 
185 190 195 

25 ACG AGC TCA ACC AGT GGG AAG GAA GGC ACT AAT AGG ACC CTG CCA AAC 979 
Thr Ser Ser Thr Ser Gly Lys Glu Gly Thr Asn Arg Thr Leu Pro Asn 
200 205 210 

CCA CCA CAA GTT ACC CAC CAG CAA GCC CCC CAC CAC AGA CAC ATT CTG 102 7 

30 Pro Pro Gin Val Thr His Gin Gin Ala Pro His His Arg His He Leu 
215 220 225 

AAG CTG CTG CCA TCG TCC ATG GAG GCC ACG GGT GAG AAG TCC AGC ACA 10 7 5 

Lys Leu Leu Pro Ser Ser Met Glu Ala Thr Gly Glu Lys Ser Ser Thr 
35 230 235 240 
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GCC ATC AAG GCC CCC AAG AGG GGT CAC CCC AGA CAG AAC GCT CAC AAG 112 3 

Ala ile.Lys Ala Pro Lys Arg Gly His Pro Arg Gin Asn Ala His Lys 
245 250 255 260 

5 CAT TTC GAC ATC AAC GAG CAC TTG CCT TGG ATG ATC GTC CTC TTC CTT 1171 
His Phe Asp lie Asn Glu His Leu Pro Trp Met lie Val Leu Phe Leu 
265 270 275 

CTG CTG GTC CTG GTG CTG ATA GTG GTG TGC AGT ATC CGA AAG AGC TCC 1219 
10 Leu Leu Val Leu Val Leu He Val Val Cys Ser He Arg Lys Ser Ser 
280 285 290 

AGG ACT CTC AAA AAG GGG CCC CGG CAG GAT CCC AGC GCC ATA GTG GAA 12 6 7 

Arg Thr Leu Lys Lys Gly Pro Arg Gin Asp Pro Ser Ala He Val Glu 
15 295 300 305 

AAG GCG GGG CTG AAG AAG TCC CTG ACT CCC ACC CAG AAC CGG GAG AAA 1315 

Lys Ala Gly Leu Lys Lys Ser Leu Thr Pro Thr Gin Asn Arg Glu Lys 

310 315 320 

20 

TGG ATC TAC TAC CGC AAC GGC CAT GGT ATT GAC ATC TTG AAG CTT GTA 136 3 

Trp He Tyr Tyr Arg Asn Gly His Gly He Asp He Leu Lys Leu Val 

325 330 ■ 335 340 

25 GCA GCC CAG GTG GGA AGC CAG TGG AAG GAC ATC TAT CAG TTT CTT TGC 1411 
Ala Ala Gin Val Gly Ser Gin Trp Lys Asp He Tyr Gin Phe Leu Cys 
345 350 355 

AAC GCC AGT GAG AGG GAG GTG GCG GCC TTC TCC AAT GGA TAC ACT GCA 14 59 

30 Asn Ala Ser Glu Arg Glu Val Ala Ala Phe Ser Asn Gly Tyr Thr Ala 
360 ' 365 370 

GAT CAT GAA CGG GCC TAC GCG GCT CTG CAG CAC TGG ACC ATC CGT GGC 1507 
Asp His Glu Arg Ala Tyr Ala Ala Leu Gin His Trp Thr He Arg Gly 
35 375 380 385 
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CCT GAG GCC AGC CTT GCC CAG CTC ATT AGC GCC TTG CGC CAG CAC CGA 155 5 

Pro Glu Ala Ser Leu Ala Gin Leu He Ser Ala Leu Arg Gin His Arg 
390 395 400 

5 CGC AAT GAT GTT GTG GAG AAG ATT CGT GGG CTG ATG GAA GAC ACC ACG 1603 
Arg Asn Asp Val Val Glu Lys He Arg Gly Leu Met Glu Asp Thr Thr 
405 410 415 420 

CAG TTG GAA ACA GAC AAA CTG GCT CTC CCC ATG AGC CCC AGT CCG CTG 16 51 

10 Gin Leu Glu Thr Asp Lys Leu Ala Leu Pro Met Ser Pro Ser Pro Leu 

425 430 435 

AGC CCG AGC CCC ATC CCC AGT CCT AAC GTG AAA CTT GAG AAT TCC ACT 16 9 9 

Ser Pro Ser Pro He Pro Ser Pro Asn Vai Lys Leu Glu Asn Ser Thr 
15 440 445 450 

CTC CTG ACA GTG GAG CCC TCA CCG CTG GAC AAG AAC AAG TGC TTC TTC 174 7 

Leu Leu Thr Val Glu Pro Ser Pro Leu Asp Lys Asn Lys Cys Phe Phe 
455 460 465 

20 

GTG GAC GAG TCA GAG CCC CTT CTG CGT TGC GAC TCC ACA TCC AGT GGC 17 95 

Val Asp Glu Ser Glu Pro Leu Leu Arg Cys Asp Ser Thr Ser Ser Gly 
470 475 480 

25 TCT TCA GCA CTG AGC AGA AAC GGC TCC TTT ATT ACC AAA GAA AAG AAG 18 4 3 

Ser Ser Ala Leu Ser Arg Asn Gly Ser Phe He Thr Lys Glu Lys Lys 
485 490 495 500 

GAC ACA GTG TTG CGG CAG GTC CGC CTG GAC CCC TGT GAC TTG CAG CCC 18 91 

30 Asp Thr Val Leu Arg Gin Val Arg Leu Asp Pro Cys Asp Leu Gin Pro 

505 ■ 510 515 

ATC TTT GAT GAC ATG CTG CAT ATC CTG AAC CCC GAG GAG CTG CGG GTG 193 9 

He Phe Asp Asp Met Leu His He Leu Asn Pro Glu Glu Leu Arg Val 
35 520 525 530 
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ATT GAA GAG ATT CCC CAG GCT GAG 
lie Glu Glu He Pro Gin Ala Glu 
535 540 

5 ATC ATT GGG GTC AAG AGC CAA GAA 
He lie Gly Val Lys Ser Gin Glu 
550 555 

GTG TAC AGT CAT CTT CCT GAC CTA 
10 Val Tyr Ser His Leu Pro Asp Leu 
565 570 

TTCTGGGAAT CAACCTACTG GCGGGGTGAT 

•15 TGTGTATGTA TGTGTTTAAC AGAGTGTATG 

CTTTCTTTCT TTCTTTCTTT CTTTCTTTCT 

CTTCCTTCCT TCCTTCCTGA AAGTGAATGT 

20 

AAATGCCCAC CACTAAATTT TTTTTAAGTT 
TATATCTTCA ACACTATTCT GTGCACTTTA 
25 CCCATATGCT GGGTTCCGAG ACTCTCAACT 
CTAGAAGTAG ACATAAGTCT TTCAACCTTC 
GCTATTTGTC TTATTGTTTG TGCTTTACAA 

30 

CCTGTTGATG ACTGTAACTC TATTCGACTT 
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GAC AAA CTG GAC CGC CTC TTC GAG 198 7 
Asp Lys Leu Asp Arg Leu Phe Glu 
545 

GCC AGC CAG ACC CTC TTG GAC TCT 2 03 5 
Ala Ser Gin Thr Leu Leu Asp Ser 
560 

TTG TAGAACACAG GGGCACTGCA 2 0 82 
Leu 

TTCATTTCGT TTCTGACTTT TGTGTTTTGG 214 2 

GCCGGTGAGT TTGGGGTTCT TTCTTTCTTT 2 2 02 

TTCTTCCTTC CTTCCTTCCT TCCTTCCTTC 22 62 

ATAAAGCCTT TACAATGTAT AACTGTTGGA 2 3 22 

CCATATATTC TCCATTTTTG CCTTCTTATA 2 3 82 

AAAACTTAAC ATAAACGCAG TGTGACTTCT 2 44 2 

TCTTAAAAAC CTAATGGCAT CTTGTGACTC 2 5 02 

ACACCTACTC TTTCTGTTTT AATTATTATT 2 562 

GCGTTCTTGA GGACGGAGGG AATCTACGAC 2 6 22 

TGAGTTGTCT TCTTCATGTC T TG T TAT AT A 26 82 



GTTCATATTC ATGGCTGAAA CTTGACCATA CTCCCTAGCG CCGCTGATTG TATGGTTTTC 2 742 

35 GTCTGGACAC CGTACACTGC CTGATAACTT GTGCACCTCT TAACGCTACT ATGCTCTGGG 2 802 

CTGGAGAATG AAATCTTTAA GTCACCAGGA CTTGCTGTTT CAGTGGCTTG ACACCTGGGC 2862 
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CACCAAAGAA CTCGATCTTC ATCTTTTAGG GACACCTCTG CTGCACCTTG GAAAGCCAAC 2 92 2 

CTTAAGTGCC AGTGGCACTT TATGCCCAGC TTTGCTTTGA AAGATATCTT TCTTGTTTTT 2 98 2 

5 

TTTTATCCTT CTCTTTCTCT CTTTTTTTTA AAAATACACA TAGTCAATAG GTCCAGTCTG 3 04 2 

CCCTCAAGGC CTTGCTGGGT TTTTTTCGTC ATCCAATCAC TTTCATTAAA AATGGCTGCA 3102 

10 GCTGTAAGAA CTCTTGTCTG ATAAATTTTC AACTATGCTC TCATTTATCT ACCTGCCCTC 316 2 

TGATGCTCAG TCGTCAGACT CTAATGCAAA GGTGGACGTC GGCTGCCTTT GCGTGGGCGG 322 2 

GCTTAGTGGT GAGGAACTGA TATCAGAAAA AAATGCCTTC AAG TAT ACTA ATTTATTAAT 3 282 

15 

AAATATTAGG TGTTTGTTAA AAAAAAAAAA AAAAAAAAAA AAAGCGGCC 3 331 
(2) INFORMATION FOR SEQ ID NO : 2 : 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 573 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (iij MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 2 : 

Met Ser Leu Arg Val Cys Ser Ser Cys Pro Ala Gly Thr Phe Thr Arg 
30 1 5 io 15 

His Glu Asn Gly lie Glu Arg Cys His Asp Cys Ser Gin Pro Cys Pro 
20 25 30 

35 Trp Pro Met lie Glu Arg Leu Pro Cys Ala Ala Leu Thr Asp Arg Glu 
35 40 45 
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Cys He Cys Pro Pro Gly Met Tyr Gin Ser Asn Gly Thr Cys Ala Pro 
50 55 60 

His Thr Val Cys Pro Val Gly Trp Gly Val Arg Lys Lys Gly Thr Glu 
5 65 70 75 80 

Asn Glu Asp Val Arg Cys Lys Gin Cys Ala Arg Gly Thr Phe Ser Asp 
85 90 95 

10 Val Pro Ser Ser Val Met Lys Cys Lys Ala His Thr Asp Cys Leu Gly 
100 105 110 

Gin Asn Leu Glu Val Val Lys Pro Gly Thr Lys Glu Thr Asp Asn Val 
115 120 125 

15 

Cys Gly Met Arg Leu Phe Phe Ser Ser Thr Asn Pro Pro Ser Ser Gly 
130 135 140 

Thr Val Thr Phe Ser His Pro Glu His Met Glu Ser His Asp Val Pro 
20 145 150 155 160 

Ser Ser Thr Tyr Glu Pro Gin Gly Met Asn Ser Thr Asp Ser Asn Ser 
165 170 175 

25 Thr Ala Ser Val Arg Thr Lys Val Pro Ser Gly He Glu Glu Gly Thr 
180 185 190 

Val Pro Asp Asn Thr Ser Ser Thr Ser Gly Lys Glu Gly Thr Asn Arg 
195 200 205 

30 

Thr Leu Pro Asn Pro Pro Gin Val Thr His Gin Gin Ala Pro His His 
210 215 220 

Arg His He Leu Lys Leu Leu Pro Ser Ser Met Glu Ala Thr Gly Glu 
35 225 230 235 240 
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Lys Ser Ser Thr Ala lie Lys Ala Pro Lys Arg Gly His Pro Arg Gin 
245 250 255 

Asn Ala His Lys His Phe Asp lie Asn Glu His Leu Pro Trp Met lie 
5 260 265 270 

Val Leu Phe Leu Leu Leu Val Leu Val Leu lie Val Val Cys Ser He 
275 230 285 

10 Arg Lys Ser Ser Arg Thr Leu Lys Lys Gly Pro Arg Gin Asp Pro Ser 
290 295 300 

Ala He Val Glu Lys Ala Gly Leu Lys Lys Ser Leu Thr Pro Thr Gin 
305 310 315 320 

15 

Asn Arg Glu Lys Trp He Tyr Tyr Arg Asn Gly His Gly He Asp He 
325 330 335 

Leu Lys Leu Val Ala Ala Gin Val Gly Ser Gin Trp Lys Asp He Tyr 
20 340 345 350 

Gin Phe Leu Cys Asn Ala Ser Glu Arg Glu Val Ala Ala Phe Ser Asn 
355 360 365 

25 Gly Tyr Thr Ala Asp His Glu Arg Ala Tyr Ala Ala Leu Gin His Trp 
370 375 380 

Thr He Arg Gly Pro Glu Ala Ser Leu Ala Gin Leu He Ser Ala Leu 
385 390 395 400 

30 

Arg Gin His Arg Arg Asn Asp Val Val Glu Lys He Arg Gly Leu Met 
405 410 415 

Glu Asp Thr Thr Gin Leu Glu Thr Asp Lys Leu Ala Leu Pro Met Ser 
35 420 425 430 
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Pro Ser Pro Leu Ser Pro Ser Pro lie Pro Ser Pro Asn Val Lys Leu 
435 440 445 

Glu Asn Ser Thr Leu Leu Thr Val Glu Pro Ser Pro Leu Asp Lys Asn 
5 450 455 460 

Lys Cys Phe Phe Val Asp Glu Ser Glu Pro Leu Leu Arg Cys Asp Ser 
465 470 475 480 

10 Thr Ser Ser Gly Ser Ser Ala Leu Ser Arg Asn Gly Ser Phe lie Thr 

485 490 495 

Lys Glu Lys Lys Asp Thr Val Leu Arg Gin Val Arg Leu Asp Pro Cys 
500 505 510 

15 

Asp Leu Gin Pro lie Phe Asp Asp Met Leu His lie Leu Asn Pro Glu 
515 520 525 

Glu Leu Arg Val lie Glu Glu lie Pro Gin Ala Glu Asp Lys Leu Asp 
20 530 535 540 

Arg Leu Phe Glu lie lie Gly Val Lys Ser Gin Glu Ala Ser Gin Thr 
545 550 555 560 



25 Leu Leu Asp Ser Val Tyr Ser His Leu Pro Asp Leu Leu 

565 570 



(2) INFORMATION FOR SEQ ID NO : 3 : 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2612 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

35 

( i i ) MOLECULE TYPE : CDNA 
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(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 190.. 951 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GCTCAGCGCC CCTAGACCCT CCCTTGCCGC CTCCCTCCTC TGCCCGGCCG TACCAGTGCA 6 0 

CATGGGGTGT TGGAGGTAGA TGGGCTCCCG GCCCGGGAGG CGGCGGTGGA TGCGGCGCTG 12 0 

10 

GGCAGAAGCA GCCGCCGATT CCAGCTGCCC CGCGCGCCCC GGGCGCCCCT GCGAGTCCCC 18 0 

GGTTCAGCC ATG GGG ACC TCT CCG AGC AGC AGC ACC GCC CTC GCC TCC 228 
Met Gly Thr Ser Pro Ser Ser Ser Thr Ala Leu Ala Ser 
15 l 5 10 

TGC AGC CGC ATC GCC CGC CGA GCC ACA GCC ACG ATG ATC GCG GGC TCC 2 76 

Cys Ser Arg He Ala Arg Arg Ala Thr Ala Thr Met He Ala Gly Ser 
15 20 25 

20 

CTT CTC CTG CTT GGA TTC CTT AGC ACC ACC ACA GCT CAG CCA GAA CAG 324 

Leu Leu Leu Leu Gly Phe Leu Ser Thr Thr Thr Ala Gin Pro Glu Gin 
30 35 40 4 5 

25 AAG GCC TCG AAT CTC ATT GGC ACA TAG CGC CAT GTT GAC CGT GCC ACC 3 72 

Lys Ala Ser Asn Leu He Gly Thr Tyr Arg His Val Asp Arg Ala Thr 
50 55 60 

GGC CAG GTG CTA ACC TGT GAC AAG TGT CCA GCA GGA ACC TAT GTC TCT 42 0 

30 Gly Gin Val Leu Thr Cys Asp Lys Cys Pro Ala Gly Thr Tyr Val Ser 

65 70 75 

GAG CAT TGT ACC AAC ACA AGC CTG CGC GTC TGC AGC AGT TGC CCT GTG 468 
■ Glu His Cys Thr Asn Thr Ser Leu Arg Val Cys Ser Ser Cys Pro Val 
35 80 85 90 
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GGG ACC TTT ACC AGG CAT GAG AAT GGC ATA GAG AAA TGC CAT GAC TGT 516 

Gly Thr Phe Thr Arg His Glu Asn Gly lie Glu Lys Cys His Asp Cys 

95 100 105 

5 AGT CAG CCA TGC CCA TGG CCA ATG ATT GAG AAA TTA CCT TGT GCT GCC 564 

Ser Gin Pro Cys Pro Trp Pro Met lie Glu Lys Leu Pro Cys Ala Ala 
110 115 120 125 

TTG ACT GAC CGA GAA TGC ACT TGC CCA CCT GGC ATG TTC CAG TCT AAC 612 

10 Leu Thr Asp Arg Glu Cys Thr Cys Pro Pro Gly Met Phe Gin Ser Asn 

130 135 140 

GCT ACC TGT GCC CCC CAT ACG GTG TGT CCT GTG GGT TGG GGT GTG CGG 660 

Ala Thr Cys Ala Pro His Thr Val Cys Pro Val Gly Trp Gly Val Arg 
15 145 150 155 

AAG AAA GGG ACA GAG ACT GAG GAT GTG CGG TGT AAG CAG TGT GCT CGG 708 

Lys Lys Gly Thr Glu Thr Glu Asp Val Arg Cys Lys Gin Cys Ala Arg 
160 165 170 

20 

GGT ACC TTC TCA GAT GTG CCT TCT AGT GTG ATG AAA TGC AAA GCA TAC 7 56 

Gly Thr Phe Ser Asp Val Pro Ser Ser Val Met Lys Cys Lys Ala Tyr 

175 180 185 

25 ACA GAC TGT CTG AGT CAG AAC CTG GTG GTG ATC AAG CCG GGG ACC AAG 8 04 

Thr Asp Cys Leu Ser Gin Asn Leu Val Val lie Lys Pro Gly Thr Lys 
190 195 200 205 

GAG ACA GAC AAC GTC TGT GGC ACA CTC CCG TCC TTC TCC AGC TCC ACC 852 

30 Glu Thr Asp Asn Val Cys Gly Thr Leu Pro Ser Phe Ser Ser Ser Thr 

210 215 220 

TCA CCT TCC CCT GGC ACA GCC ATC TTT CCA CGC CCT GAG CAC ATG GAA 90 0 

Ser Pro Ser Pro Gly Thr Ala lie Phe Pro Arg Pro Glu His Met Glu 
35 225 230 235 
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ACC CAT GAA GTC CCT TCC TCC ACT TAT GTT CCC AAA GAC CAA AGG TAC 94 8 
Thr His Glu Val Pro Ser Ser Thr Tyr Val Pro Lys Asp Gin Arg Tyr 
240 245 250 

5 TGAGTAGCAT CCAGGAAGGG ACAGTCCCTG ACAACACAAG CTCAGCAAGG GGGAAGGAAG 1008 

ACGTGAACAA GACCCTCCCA AACCTTCAGG TAGTCAACCA CCAGCAAGGC CCCCACCACA 106 8 

GACACATCCT GAAGCTGCTG CCGTCCATGG AGGCCACTGG GGGCGAGAAG TCCAGCACGC 112 8 

CCATCAAGGG CCCCAAGAGG GGACATCCTA GACAGAACCT ACACAAGCAT TTTGACATCA 118 8 

ATGAGCATTT GCCCTGGATG ATTGTGCTTT TCCTGCTGCT GGTGCTTGTG GTGATTGTGG 12 4 8 

15 TGTGCAGTAT CCGGAAAAGC TCGAGGACTC TGAAAAAGGG GCCCCGGCAG GATCCCAGTG 13 0 8 

CCATTGTGGA AAAGGCAGGG CTGAAGAAAT CCATGACTCC AACCCAGAAC CGGGAGAAAT 13 6 8 

GGATCTACTA CTGCAATGGC CATGGTATCG ATATCCTGAA GCTTGTAGCA GCCCAAGTGG 142 8 

20 

GAAGCCAGTG GAAAGATATC TATCAGTTTC TTTGCAATGC CAGTGAGAGG GAGGTTGCTG 14 8 8 

CTTTCTCCAA TGGGTACACA GCCGACCACG AGCGGGCCTA CGCAGCTCTG CAGCACTGGA 154 8 

25 CCATCCGGGG CCCCGAGGCC AGCCTCGCCC AGCTAATTAG CGCCCTGCGC CAGCACCGGA 16 08 

GAAACGATGT TGTGGAGAAG ATTCGTGGGC TGATGGAAGA CACCACCCAG GTAATGGAGC 166 8 

CCTTGTTGTG TGTCATTACC ACCGACCTAT TGCCCCTATG CTTCAAATTT TATCAGTTGT 1728 

30 

ATGGGAACAA AGAAAAATAA CATATTCGGT GGATAGGCAC ACACACACAC ACACGCATAC 1788 

GCCTGCACAC ACACACACAC ACCCTACCTT CTAGGACGGG GGTTCTCAGT GGCCGTCTAT 18 4 8 

35 T AG AAT CAT C TAGAAAACTT TAAAAAAAAA TACTGATGCT CAGACCCTAC CTGCAGACCA 1908 

GTCACATCAG AATCTCCAGG GGGCAGAGCG TGAATCG.GTA TTTGTAAAAG CTCTTTGTTA 196 8 
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CTCCATTTAC AATCCATTTT GCATGACACA CTTTGAACAA AACCAAGAAA AAATACTTTT 2 02 8 

TACTACACCG CCTCTCCTCC AGAGGGTGTT TTTGTGATGT GGCTTATGAA GGCAGCATTC 2 03 8 

5 

TTGCCTCCTG AGGATGCAGG TGGTGCTAGC GGCAGTTGAT GACAGAACTG ATTCTCCTCC 214 8 

TTGGGTTGTT CCGTGGAGCA CATCAGATGG GAACTGAGGG GACCCAGGAG TGTGATTTCT 2 208 

10 TTATAGCTAA TAAGCCCTGG CTTTGGAGCC AGACAGCGCT GGATTTGAAT CCTGGCTCTG 2 2 68 

GTACATATTA GCTTAGGTGA TGAAGGGTAA GTTACTTCAA CTTTCCTTGC CTCTGTTATT 2 32 8 

CACATTTTCA AGTCTGCTAT ATAAGATTAA GATGAGAAAT AAAGCATATA AAATGCCTGA 2 3 88 

15 

CTCATTGAAA GTGTTCTACA AGTGGTAGTT ACGACCATGA TGTAACTCAT TTTACTTAGC 2 44 8 

CTTTCTTTAA TTGTATGTAC TTCCCTGAAA GGCCATGAAT AAAGTTCAGA TTTGGATATT 2 5 08 

20 GAATCATATT TTCCACAGAC TTCAATTCAG GTTTCAGAAC ATATTCCCAA AGTAAAGAAA 2 568 

ATGCTGCCAC TAAGACTAGA TAAAACCCAC TTCAGATTGG TAAC 2 612 
(2) INFORMATION FOR SEQ ID NO : 4 : 



25 



30 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 253 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 



JO Met Gly Thr Ser Pro Ser Ser Ser Thr Ala Leu Ala Ser Cys Ser Arg 
15 10 15 
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He Ala Arg 

Leu Gly Phe 
5 35 

Asn Leu He 
50 

10 Leu Thr Cys 
65 

Thr Asn Thr 

15 

Thr Arg His 
Cys Pro Trp 

20 us 

Arg Glu Cys 
130 

25 Ala Pro His 
145 

Thr Glu Thr 

30 

Ser Asp Val 

Leu Ser Gin 
35 195 



Arg Ala Thr Ala 
20 

Leu Ser Thr Thr 



Gly Thr Tyr Arg 
55 

Asp Lys Cys Pro 
70 

Ser Leu Arg Val 
85 

Glu Asn Gly lie 
100 

Pro Met He Glu 

Thr Cys Pro Pro 
135 

Thr Val Cys Pro 
150 

Glu Asp Val Arg 
165 

Pro Ser Ser Val 
180 

Asn Leu Val Val 



- 16- 

Thr Met lie Ala Gly 
25 

Thr Ala Gin Pro Glu 
40 

His Val Asp Arg Ala 
60 

Ala Gly Thr Tyr Val 
75 

Cys Ser Ser Cys Pro 
90 

Glu Lys Cys His Asp 
105 

Lys Leu Pro Cys Ala 
120 

Gly Met Phe Gin Ser 
140. 

Val Gly Trp Gly Val 
155 

Cys Lys Gin Cys Ala 
170 

Met Lys Cys Lys Ala 
185 

He Lys Pro Gly Thr 
200 



Ser Leu Leu Leu 
30 

Gin Lys Ala Ser 
45 

Thr Gly Gin Val 

Ser Glu His Cys 
80 

Val Gly Thr Phe 
95 

Cys Ser Gin Pro 
110 

Ala Leu Thr Asp 
125 

Asn Ala Thr Cys 

Arg Lys Lys Gly 
160 

Arg Gly Thr Phe 
175 

Tyr Thr Asp Cys 
190 

Lys Glu Thr Asp 
205 
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Asn Val Cys Gly Thr Leu Pro Ser Phe Ser Ser Ser Thr Ser Pro Ser 
210 215 220 

Pro Gly Thr Ala He Phe Pro Arg Pro Glu His Met Glu Thr His Glu 
5 225 . 230 235 240 

Val Pro Ser Ser Thr Tyr Val Pro Lys Asp Gin Arg Tyr 
245 250 

10 (2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1720 base pairs 
(3) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : cDNA 

20 fix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1720 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

25 

ATG AGC CTG CGA GTC TGC AGC AGC TGC CCC GCG GGG ACC TTT ACC AGG 4 8 

Met Ser Leu Arg Val Cys Ser Ser Cys Pro Ala Gly Thr Phe Thr Arg 
15 10 15 

30 CAC GAG AAC GGC ATA GAG AGA TGC CAT GAC TGT AGT CAG CCA TGT CCA 96 
His Glu Asn Gly lie Glu Arg Cys His Asp Cys Ser Gin Pro Cys Pro 
20 25 30 

TGG CCG ATG ATT GAG AGA TTA CCT TGT GCT GCC TTG ACT GAC CGA GAG 14 4 

35 Trp Pro Met lie Glu Arg Leu Pro Cys Ala Ala Leu Thr Asp Arg Glu 
35 40 45 
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TGC ATC TGC CCA CCT GGA ATG TAT CAG TCT AAT GGT ACC TGC GCT CCC 192 
Cys lie Cys Pro Pro Gly Met Tyr Gin Ser Asn Gly Thr Cys Ala Pro 
50 55 60 



3 CAT ACA GTG TGC CCC GTG GGC TGG GGT GTG CGG AAG AAA GGG ACA GAG 2 40 

His Thr Val Cys Pro Val Gly Trp Gly Val Arg Lys Lys Gly Thr Glu 
65 70 75 80 

AAT GAA GAT GTG CGC TGT AAG CAG TGC GCT CGG GGT ACC TTC TCT GAC 288 
10 Asn Glu Asp Val Arg Cys Lys Gin Cys Ala Arg Gly Thr Phe Ser Asp 

85 90 95 

GTG CCT TCC AGT GTG ATG AAG TGT AAA GCT CAC ACG GAC TGT CTG GGT 3 36 

Val Pro Ser Ser Val Met Lys Cys Lys Ala His Thr Asp Cys Leu Gly 
15 100 105 110 

CAG AAC CTG GAG GTG GTC AAG CCA GGG ACC AAG GAG ACA GAC AAC GTC 3 84 

Gin Asn Leu Glu Val Val Lys Pro Gly Thr Lys Glu Thr Asp Asn Val 

115 120 125 

20 

TGT GGC ATG CGC CTG TTC TTC TCC AGC ACA AAC CCA CCT TCC TCT GGC 432 

Cys Gly Met Arg Leu Phe Phe Ser Ser Thr Asn Pro Pro Ser Ser Gly 

130 135 140 

25 ACA GTT ACC TTT TCT CAC CCT GAG CAT ATG GAA TCC CAC GAT GTC CCT 4 80 

Thr Val Thr Phe Ser His Pro Glu His Met Glu Ser His Asp Val Pro 
145 150 155 160 

TCC TCC ACC TAT GAG CCC CAA GGC ATG AAC TCA ACA GAT TCC AAC TCT 52 8 

30 Ser Ser Thr Tyr Glu Pro Gin Gly Met Asn Ser Thr Asp Ser Asn Ser 

165 170 175 

ACT GCC TCT GTT AGA ACT AAG GTA CCA AGT GGC ATC GAG GAA GGG ACA 5 76 

Thr Ala Ser Val Arg Thr Lys Val Pro Ser Gly lie Glu Glu Gly Thr 
35 180 185 190 



BNSDOCID:<WO 9915663A1> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/15663 



PCT/US98/20219 



- 19 - 

GTG CCT GAC AAT ACG AGC TCA ACC AGT GGG AAG GAA GGC ACT AAT AGG 6 24 

Val Pro Asp Asn Thr Ser Ser Thr Ser Gly Lys Glu Gly Thr Asn Arg 
195 200 205 

5 ACC CTG CCA AAC CCA CCA CAA GTT ACC CAC CAG CAA GCC CCC CAC CAC 6 72 

Thr Leu Pro Asn Pro Pro Gin Val Thr His Gin Gin Ala Pro His His 
210 215 220 

AGA CAC ATT CTG AAG CTG CTG CCA TCG TCC ATG GAG GCC ACG GGT GAG 72 0 

10 Arg His lie Leu Lys Leu Leu Pro Ser Ser Met Glu Ala Thr Gly Glu 
225 230 235 240 

AAG TCC AGC ACA GCC ATC AAG GCC CCC AAG AGG GGT CAC CCC AGA CAG 76 8 

Lys Ser Ser Thr Ala He Lys Ala Pro Lys Arg Gly His Pro Arg Gin 
15 245 250 255 

AAC GCT CAC AAG CAT TTC GAC ATC AAC GAG CAC TTG CCT TGG ATG ATC 816 

Asn Ala His Lys His Phe Asp He Asn Glu His Leu Pro Trp Met lie 

260 265 270 

20 

GTC CTC TTC CTT CTG CTG GTC CTG GTG CTG ATA GTG GTG TGC AGT ATC 864 

Val Leu Phe Leu Leu Leu Val Leu Val Leu lie Val Val Cys Ser He 

275 280 285 

25 CGA AAG AGC TCC AGG ACT CTC AAA AAG GGG CCC CGG CAG GAT CCC AGC 912 
Arg Lys Ser Ser Arg Thr Leu Lys Lys Gly Pro Arg Gin Asp Pro Ser 
290 295 300 

GCC ATA GTG GAA AAG GCG GGG CTG AAG AAG TCC CTG ACT CCC ACC CAG 96 0 

30 Ala He Val Glu Lys Ala Gly Leu Lys Lys Ser Leu Thr Pro Thr Gin 
305 310 315 320 

AAC CGG GAG AAA TGG ATC TAC TAC CGC AAC GGC CAT GGT ATT GAC ATC 1008 
Asn Arg Glu Lys Trp He Tyr Tyr Arg Asn Gly His Gly He Asp He 
35 325 330 335 
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TTG AAG CTT GTA GCA GCC CAG GTG GGA AGC CAG TGG AAG GAC ATC TAT 1056 
Leu Lys Leu Val Ala Ala Gin Val Gly Ser Gin Trp Lys Asp lie Tyr 
340 345 350 

5 CAG TTT CTT TGC AAC GCC AGT GAG AGG GAG GTG GCG GCC TTC TCC AAT 1104 
Gin Phe Leu Cys Asn Ala Ser Glu Arg Glu Val Ala Ala Phe Ser Asn 
355 360 365 

GGA TAC ACT GCA GAT CAT GAA CGG GCC TAC GCG GCT CTG CAG CAC TGG 1152 
10 Gly Tyr Thr Ala Asp His Glu Arg Ala Tyr Ala Ala Leu Gin His Trp 
370 375 380 

ACC ATC CGT GGC CCT GAG GCC AGC CTT GCC CAG CTC ATT AGC GCC TTG 12 0 0 

Thr lie Arg Gly Pro Glu Ala Ser Leu Ala Gin Leu lie Ser Ala Leu 
15 385 390 395 400 

CGC CAG CAC CGA CGC AAT GAT GTT GTG GAG AAG ATT CGT GGG CTG ATG 124 8 

Arg Gin His Arg Arg Asn Asp Val Val Glu Lys lie Arg Gly Leu Met 
405 410 415 

20 

GAA GAC ACC ACG CAG TTG GAA ACA GAC AAA CTG GCT CTC CCC ATG AGC 12 96 

Glu Asp Thr Thr Gin Leu Glu Thr Asp Lys Leu Ala Leu Pro Met Ser 
420 425 430 

25 CCC AGT CCG CTG AGC CCG AGC CCC ATC CCC AGT CCT AAC GTG AAA CTT 13 4 4 

Pro Ser Pro Leu Ser Pro Ser Pro lie Pro Ser Pro Asn Val Lys Leu 
435 440 445 

GAG AAT TCC ACT CTC CTG ACA GTG GAG CCC TCA CCG CTG GAC AAG AAC 13 92 

30 Glu Asn Ser Thr Leu Leu Thr Val Glu Pro Ser Pro Leu Asp Lys Asn 
450 455 460 

AAG TGC TTC TTC GTG GAC GAG TCA GAG CCC CTT CTG CGT TGC GAC TCC 14 4 0 

Lys Cys Phe Phe Val Asp Glu Ser Glu Pro Leu Leu Arg Cys Asp Ser 
35 465 470 475 480 
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ACA TCC AGT GGC TCT TCA GCA CTG AGC AGA AAC GGC TCC TTT ATT ACC 14 8 8 

Thr Ser Ser Gly Ser Ser Ala Leu Ser Arg Asn Gly Ser Phe lie Thr 
485 490 495 

5 AAA GAA AAG AAG GAC ACA GTG TTG CGG CAG GTC CGC CTG GAC CCC TGT 153 6 

Lys Glu Lys Lys Asp Thr Val Leu Arg Gin Val Arg Leu Asp Pro Cys 
500 505 510 

GAC TTG CAG CCC ATC TTT GAT GAC ATG CTG CAT ATC CTG AAC CCC GAG 15 84 

10 Asp Leu Gin Pro lie Phe Asp Asp Met Leu His lie Leu Asn Pro Glu 
515 520 525 

GAG CTG CGG GTG ATT GAA GAG ATT CCC CAG GCT GAG GAC AAA CTG GAC 16 3 2 

Glu Leu Arg Val lie Glu Glu lie Pro Gin Ala Glu Asp Lys Leu Asp 
15 530 535 540 

CGC CTC TTC GAG ATC ATT GGG GTC AAG AGC CAA GAA GCC AGC CAG ACC 16 8 0 

Arg Leu Phe Glu lie lie Gly Val Lys Ser Gin Glu Ala Ser Gin Thr 
545 550 555 560 

20 

CTC TTG GAC TCT GTG TAC AGT CAT CTT CCT GAC CTA TTG 172 0 

Leu Leu Asp Ser Val Tyr Ser His Leu Pro Asp Leu Leu 
565 570 

25 (2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 59 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: CDNA 

35 (ix) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .75 9 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

ATG GGG ACC TCT CCG AGC AGC AGC ACC GCC CTC GCC TCC TGC AGC CGC 4 8 

5 Met Gly Thr Ser Pro Ser Ser Ser Thr Ala Leu Ala Ser Cys Ser Arg 
15 10 15 

ATC GCC CGC CGA GCC ACA GCC ACG ATG ATC GCG GGC TCC CTT CTC CTG 96 
lie Ala Arg Arg Ala Thr Ala Thr Met lie Ala Gly Ser Leu Leu Leu 
10 20 25 30 

CTT GGA TTC CTT AGC ACC ACC ACA GCT CAG CCA GAA CAG AAG GCC TCG 144 

Leu Gly Phe Leu Ser Thr Thr Thr Ala Gin Pro Glu Gin Lys Ala Ser 
35 40 45 

15 

AAT CTC ATT GGC ACA TAC CGC CAT GTT GAC CGT GCC ACC GGC CAG GTG 192 

Asn Leu He Gly Thr Tyr Arg His Val Asp Arg Ala Thr Gly Gin Val 
50 55 60 

20 CTA ACC TGT GAC AAG TGT CCA GCA GGA ACC TAT GTC TCT GAG CAT TGT 24 0 

Leu Thr Cys Asp Lys Cys Pro Ala Gly Thr Tyr Val Ser Glu His Cys 
65 70 75 80 

ACC AAC ACA AGC CTG CGC GTC TGC AGC AGT TGC CCT GTG GGG ACC TTT 28 3 

25 Thr Asn Thr Ser Leu Arg Val Cys Ser Ser Cys Pro Val Gly Thr Phe 

85 90 95 

ACC AGG CAT GAG AAT GGC ATA GAG AAA TGC CAT GAC TGT AGT CAG CCA 3 36 

Thr Arg His Glu Asn Gly He Glu Lys Cys His Asp Cys Ser Gin Pro 
30 100 105 110 

TGC CCA TGG CCA ATG ATT GAG AAA TTA CCT TGT GCT GCC TTG ACT GAC 3 84 

Cys Pro Trp Pro Met He Glu Lys Leu Pro Cys Ala Ala Leu Thr Asp 

115 120 12.5 

35 
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CGA GAA TGC ACT TGC CCA CCT GGC ATG TTC CAG TCT AAC GCT ACC TGT 4 32 

Arg Glu Cys Thr Cys Pro Pro Gly Met Phe Gin Ser Asn Ala Thr Cys 
130 135 140 

5 GCC CCC CAT ACG GTG TGT CCT GTG GGT TGG GGT GTG CGG AAG AAA GGG 4 80 

Ala Pro His Thr Val Cys Pro Val Gly Trp Gly Val Arg Lys Lys Gly 
145 150 155 160 

ACA GAG ACT GAG GAT GTG CGG TGT AAG CAG TGT GCT CGG GGT ACC TTC 52 8 

10 Thr Glu Thr Glu Asp Val Arg Cys Lys Gin Cys Ala Arg Gly Thr Phe 

165 170 175 

TCA GAT GTG CCT TCT AGT GTG ATG AAA TGC AAA GCA TAC ACA GAC TGT 5 76 

Ser Asp Val Pro Ser Ser Val Met Lys Cys Lys Ala Tyr Thr Asp Cys 
15 180 185 190 

CTG AGT CAG AAC CTG GTG GTG ATC AAG CCG GGG ACC AAG GAG ACA GAC 624 

Leu Ser Gin Asn Leu Val Val lie Lys Pro Gly Thr Lys Glu Thr Asp 

195 200 205 

20 

AAC GTC TGT GGC ACA CTC CCG TCC TTC TCC AGC TCC ACC TCA CCT TCC 6 72 

Asn Val Cys Gly Thr Leu Pro Ser Phe Ser Ser Ser Thr Ser Pro Ser 
210 215 220 

25 CCT GGC ACA GCC ATC TTT CCA CGC CCT GAG CAC ATG GAA ACC CAT GAA 72 0 

Pro Gly Thr Ala lie Phe Pro Arg Pro Glu His Met Glu Thr His Glu 
225 230 235 240 

GTC CCT TCC TCC ACT TAT GTT CCC AAA GAC CAA AGG TAC 75 9 

30 Val Pro Ser Ser Thr Tyr Val Pro Lys Asp Gin Arg Tyr 

245 250 

(2) INFORMATION FOR SEQ ID NO : 7 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

10 (xi) SEQUENCE DESCRIPTION : SEQ ID NO : 7 : 

Met Ala Pro Val Ala Val Trp Ala Ala Leu Ala Val Gly Leu Glu Leu 
15 10 15 

15 Trp Ala Ala Ala His Ala Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr 

" 20 25 30 

Ala Pro Glu Pro Gly Ser Thr Cys Arg Leu Arg Glu Tyr Tyr Asp Gin 
35 40 45 

20 

Thr Ala Gin Met Cys Cys Ser Lys Cys Ser Pro Gly Gin His Ala Lys 
50 55 60 

Val Phe Cys Thr Lys Thr Ser Asp Thr Val Cys Asp Ser Cys Glu Asp 
25 65 70 75 80 

Ser Thr Tyr Thr Gin Leu Trp Asn Trp Val Pro Glu Cys Leu Ser Cys 
85 90 95 

30 Gly Ser Arg Cys Ser Ser Asp Gin Val Glu Thr Gin Ala Cys Thr Arg 

100 105 110 

Glu Gin Asn Arg lie Cys Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu 
115 120 125 



3D 



Ser Lys Gin Glu Gly Cys Arg Leu Cys Ala Pro Leu Arg Lys Cys Arg 
130 135 140 
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Pro Gly Phe Gly Val Ala Arg Pro Gly Thr Glu Thr Ser Asp Val Val 

145 150 155 160 

5 Cys Lys Pro Cys Ala Pro Gly Thr Phe Ser Asn Thr Thr Ser Ser Thr 

165 170 175 



Asp He Cys Arg Pro His Gin lie Cys Asn Val Val Ala He Pro Gly 
180 135 190 

10 

Asn Ala Ser Arg Asp Ala Val Cys Thr Ser Thr Ser Pro Thr Arg Ser 
195 200 205 

Met Ala Pro Gly Ala Val His Leu Pro Gin Pro Val Ser Thr Arg Ser 
15 210 215 220 



Gin His Thr Gin Pro Thr Pro Glu Pro Ser Thr Ala Pro Ser Thr Ser 

225 230 235 240 

20 Phe Leu Leu Pro Met Gly Pro Ser Pro Pro Ala Glu Gly Ser Thr Gly 

245 250 255 



Asp Phe Ala Leu Pro Val Gly Leu He Val Gly Val Thr Ala Leu Gly 
260 265 270 

25 

Leu Leu He He Gly Val Val Asn Cys Val He Met Thr Gin Val Lys 
275 230 285 



Lys Lys Pro Leu Cys Leu Gin Arg Glu Ala Lys Val Pro His Leu Pro 

30 290 295 300 

Ala Asp Lys Ala Arg Gly Thr Gin Gly Pro Glu Gin Gin His Leu Leu 

305 310 315 320 



35 He Thr Ala Pro Ser Ser Ser Ser Ser Ser Leu Glu Ser Ser Ala Ser 

325 330 335 
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Ala Leu Asp Arg Arg Ala Pro Thr Arg Asn Gin Pro Gin Ala Pro Gly 
340 345 350 

Val Glu Ala Ser Gly Ala Gly Glu Ala Arg Ala Ser Thr Gly Ser Ser 
5 355 360 365 

Asp Ser Ser Pro Gly Gly His Gly Thr Gin Val Asn Val Thr Cys lie 
370 375 380 

10 Val Asn Val Cys Ser Ser Ser Asp His Ser Ser Gin Cys Ser Ser Gin 

385 390 395 400 

Ala Ser Ser Thr Met Gly Asp Thr Asp Ser Ser Pro Ser Glu Ser Pro 
405 410 415 



15 



20 



Lys Asp Glu Gin Val Pro Phe Ser Lys Glu Glu Cys Ala Phe Arg Ser 
420 425 430 

Gin Leu Glu Thr Pro Glu Thr Leu Leu Gly Ser Thr Glu Glu Lys Pro 
435 440 445 

Leu Pro Leu Gly Val Pro Asp Ala Gly Met Lys Pro Ser 
450 455 460 



2D (2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



5 



(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 
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Met Ala Pro Ala Ala Leu Trp Val Ala Leu Val Phe Glu Leu Gin Leu 
15 10 15 

Trp Ala Thr Gly His Thr Val Pro Ala Gin Val Val Leu Thr Pro Tyr 
20 25 30 

Lys Pro Glu Pro Gly Tyr Glu Cys Gin lie Ser Gin Glu Tyr Tyr Asp 
35 40 45 

Arg Lys Ala Gin Met Cys Cys Ala Lys Cys Pro Pro Gly Gin Tyr Val 

50 55 60 

Lys His Phe Cys Asn Lys Thr Ser Asp Thr Val Cys Ala Asp Cys Glu 
65 70 75 80 

Ala Ser Met Tyr Thr Gin Val Trp Asn Gin Phe Arg Thr Cys Leu Ser 
85 90 95 



Cys Ser Ser Ser Cys Thr Thr Asp Gin Val Glu lie Arg Ala Cys Thr 
20 100 105 110 

Lys Gin Gin Asn Arg Val Cys Ala Cys Glu Ala Gly Arg Tyr Cys Ala 
115 120 125 

25 Leu Lys Thr His Ser Gly Ser Cys Arg Gin Cys Met Arg Leu Ser Lys 

130 135 140 

Cys Gly Pro Gly Phe Gly Val Ala Ser Ser Arg Ala Pro Asn Gly Asn 
145 150 155 160 

30 

Val Leu Cys Lys Ala Cys Ala Pro Gly Thr Phe Ser Asp Thr Thr Ser 
165 170 175 

Ser Thr Asp Val Cys Arg Pro His Arg lie Cys Ser lie Leu Ala lie 
35 180 185 190 
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Pro Gly Asn Ala Ser Thr Asp Ala Vai Cys Ala Pro Glu Ser Pro Thr 
195 200 '205 

Leu Ser Ala lie Pro Arg Thr Leu Tyr Val Ser Gin Pro Glu Pro Thr 
210 215 220 

Arg Ser Gin Pro Leu Asp Gin Glu Pro Gly Pro Ser Gin Thr Pro Ser 
225 230 235 240 

lie Leu Thr Ser Leu Gly Ser Thr Pro lie lie Glu Gin Ser Thr Lys 
245 250 255 

Gly Gly lie Ser Leu Pro lie Gly Leu He Val Gly Val Thr Ser Leu 
260 265 270 

Gly Leu Leu Met Leu Gly Leu Val Asn Cys He lie Leu Val Gin Arg 
275 280 285 

Lys Lys Lys Pro Ser Cys Leu Gin Arg Asp Ala Lys Val Pro His Val 
290 295 300 

Pro Asp Glu Lys Ser Gin Asp Ala Val Gly Leu Glu Gin Gin Kis Leu 
305 310 315 320 

Leu Thr Thr Ala Pro Ser Ser Ser Ser Ser Ser Leu Glu Ser Ser Ala 
325 330 335 

Ser Ala Gly Asp Arg Arg Ala Pro Pro Gly Gly His Pro Gin Ala Arg 
340 345 350 

Val Met Ala Glu Ala Gin Gly Phe Gin Glu Ala Arg Ala Ser Ser Arg 
355 360 365 

He Ser Asp Ser Ser His Gly Ser His Gly Thr His Val Asn Val Thr 
370 375 380 
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Cys lie Val Asn Val Cys Ser Ssr Ser Asp His Ser Ser Gin Cys Ser 
385 390 395 400 

Ser Gin Ala Ser Ala Thr Val Gly Asp Pro Asp Ala Lys Pro Ser Ala 
5 405 410 415 

Ser Pro Lys Asp Glu Gin Val Pro Phe Ser Gin Glu Glu Cys Pro Ser 
420 425 " 430 

10 Gin Ser Pro Cys Glu Thr Thr Glu Thr Leu Gin Ser His Glu Lys Pro 

435 440 445 

Leu Pro Leu Gly Val Pro Asp Met Gly Met Lys Pro Ser Gin Ala Gly 
450 455 460 

15 

Trp Phe Asp Gin lie Ala Val Lys Val Ala 
465 470 

(2) INFORMATION FOR SEQ ID NO : 9 : 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Met Leu Arg Leu lie Ala Leu Leu Val Cys Val Val Tyr Val Tyr Gly 
15 10 15 

35 Asp Asp Val Pro Tyr Ser Ser Asn Gin Gly Lys Cys Gly Gly His Asp 

20 25 30 
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Tyr Glu Lys Asp Gly Leu Cys Cys Ala Ser Cys His Pro Gly Phe Tyr 
35 40 45 

Ala Ser Arg Leu Cys Gly Pro Gly Ser Asn Thr Val Cys Ser Pro Cys 
50 55 60 

Glu Asp Gly Thr Phe Thr Ala Ser Thr Asn His Ala Pro Ala Cys Val 
65 70 75 80 

Ser Cys Arg Gly Pro Cys Thr Gly His Leu Ser Glu Ser Gin Pro Cys 
85 90 95 

Asp Arg Thr His Asp Arg Val Cys Asn Cys Ser Thr Gly Asn Tyr Cys 
100 105 110 

Leu Leu Lys Gly Gin Asn Gly Cys Arg lie Cys Ala Pro Gin Thr Lys 
115 120 125 

Cys Pro Ala Gly Tyr Gly Val Ser Gly His Thr Arg Ala Gly Asp Thr 
130 135 140 

Leu Cys Glu Lys Cys Pro Pro His Thr Tyr Ser Asp Ser Leu Ser Pro 
145 150 155 160 

Thr Glu Arg Cys Gly Thr Ser Phe Asn Tyr He Ser Val Gly Phe Asn 
165 170 175 

Leu Tyr Pro Val Asn Glu Thr Ser Cys Thr Thr Thr Ala Gly His Asn 
180 185 190 

Glu Val He Lys Thr Lys Glu Phe Thr Val Thr Leu Asn Tyr Thr Asp 
195. 200 205 

Cys Asp Pro Val Phe His Thr Glu Tyr Tyr Ala Thr Ser Gly Lys Glu 
210 215 220 
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Gly Ala Gly Gly Phe Phe Thr Gly Thr Asp lie Tyr Gin Asn Thr Thr 
225 230 235 240 

Lys Val Cys Thr Leu Asn Val Glu lie Gin Cys Ser Glu Gly Asp Asp 
5 245 250 255 

lie His Thr Leu Gin Lys Thr Asn Gly Gly Ser Thr Met Pro His Ser 
260 2S5 270 

10 Glu Thr lie Thr Val Val Gly Ser Cys Leu Ser Asp Val Asn Val Asp 

275 280 285 

lie Met Tyr Ser Asp Thr Asn Kis Pro Gly Glu Val Asp Asp Phe Val 
290 295 300 



15 



20 



Glu Tyr His Trp Gly Thr Arg Leu Arg Phe Phe Pro Leu Pro Lys Arg 
305 310 315 320 

Cys Thr Pro Val Ser 
325 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 277 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: peptide 



(v) FRAGMENT TYPE : internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



^3 Met Val Arg Leu Pro Leu Gin Cys Val Leu Trp Gly Cys Leu Leu Thr 

1 5 10 .15 
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Ala Val His Pro Glu Pro Pro Thr Ala Cys Arg Glu Lys Gin Tyr Leu 
20 25 30 

lie Asn Ser Gin Cys Cys Ser Leu Cys Gin Pro Gly Gin Lys Leu Val 
35 40 45 

Ser Asp Cys Thr Glu Phe Thr Glu Thr Glu Cys Leu Pro Cys Gly Glu 
50 55 60 

Ser Glu Phe Leu Asp Thr Trp Asn Arg Glu Thr His Cys His Gin His 
65 70 75 30 

Lys Tyr Cys Asp Pro Asn Leu Gly Leu Arg Val Gin Gin Lys Gly Thr 
85 90 95 

Ser Glu Thr Asp Thr lie Cys Thr Cys Glu Glu Gly Trp His Cys Thr 
100 105 110 

Ser Glu Ala Cys Glu Ser Cys Val Leu His Arg Ser Cys Ser Pro Gly 
115 120 125 

Phe Gly Val Lys Gin lie Ala Thr Gly Val Ser Asp Thr lie Cys Glu 
130 135 140 

Pro Cys Pro Val Gly Phe Phe Ser Asn Val Ser Ser Ala Phe Glu Lys 
145 150 155 160 

Cys His Pro Trp Thr Ser Cys Glu Thr Lys Asp Leu Val Val Gin Gin 
165 170 175 

Ala Gly Thr Asn Lys Thr Asp Val Val Cys Gly Pro Gin Asp Arg Leu 
180 185 190 

Arg Ala Leu Val Val lie Pro lie lie Phe Gly lie Leu Phe Ala lie 
195 200 205 
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Leu Leu Val Leu Val Phe lie Lys Lys Val Ala Lys Lys Pro Thr Asn 
210 215 220 

Lys Ala Pro His Pro Lys Gin Glu Pro Gin Glu lie Asn Phe Pro Asp 
225 230 235 240 

Asp Leu Pro Gly Ser Asn Thr Ala Ala Pro Val Gin Glu Thr Leu His 
245 250 255 

Gly Cys Gin Pro Val Thr Gin Glu Asp Gly Lys Glu Ser Arg lie Ser 
260 265 270 

Val Gin Glu Arg Gin 
275 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 289 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 11: 

Met Val Ser Leu Pro Arg Leu Cys Ala Leu Trp Gly Cys Leu Leu Thr 
15 10 15 

Ala Val His Leu Gly Gin Cys Val Thr Cys Ser Asp Lys Gin Tyr Leu 
20 25 30 

His Asp Gly Gin Cys Cys Asp Leu Cys Gin Pro Gly Ser Arg Leu Thr 
35 ~ 40 45 
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Ser His Cys Thr Ala Leu Glu Lys Thr Gin Cys His Pro Cys Asp Ser 
50 55 60 

Gly Glu Phe Ser Ala Gin Trp Asn Arg Glu lie Arg Cys His Gin His 
5 65 70 75 80 

Arg His Cys Glu Pro Asn Gin Gly Leu Arg Val Lys Lys Glu Gly Thr 
85 90 95 

10 Ala Glu Ser Asp Thr Val Cys Thr Cys Lys Glu Gly Gin His Cys Thr 

100 105 110 

Ser Lys Asp Cys Glu Ala Cys Ala Gin His Thr Pro Cys lie Pro Gly 
115 120 125 

15 

Phe Gly Val Met Glu Met Ala Thr Glu Thr Thr Asp Thr Val Cys His 
130 135 140 

Pro Cys Pro Val Gly Phe Phe Ser Asn Gin Ser Ser Leu Phe Glu Lys 
20 145 150 155 160 

Cys Tyr Pro Trp Thr Ser Cys Glu Asp Lys Asn Leu Glu Val Leu Gin 
165 170 175 

25 Lys Gly Thr Ser Gin Thr Asn Val lie Cys Gly Leu Lys Ser Arg Met 

180 185 190 

Arg Ala Leu Leu Val lie Pro Val Val Met Gly lie Leu He Thr He 
195 200 205 

30 

Phe Gly Val Phe Leu Tyr He Lys Lys Val Val Lys Lys Pro Lys Asp 
210 215 220 

Asn Glu Met Leu Pro Pro Ala Ala Arg Arg Gin Asp Pro Gin Glu Met 
35 225 230 235 240 
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Glu Asp Tyr Pro Gly His Asn Thr Ala Ala Pro Val Gin Glu Thr Leu 
245 250 255 

His Gly Cys Gin Pro Val Thr Gin Glu Asp Gly Lys Glu Ser Arg He 
260 265 270 

Ser Val Gin Glu Arg Gin Val Thr Asp Ser He Ala Leu Arg Pro Leu 
275 280 285 

Val 



(2) INFORMATION FOR SEQ ID NO: 12: 

• <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Asn Lys Leu Leu Cys Cys Ala Leu Val Phe Leu Asp He Ser He 
15 10 15 

Lys Trp Thr Thr Gin Glu Thr Phe Pro Pro Lys Tyr Leu His Tyr Asp 
20 25 30 

Glu Glu Thr Ser His Gin Leu Leu Cys Asp L,ys Cys Pro Pro Gly Thr 
35 40 45 

Tyr Leu Lys Gin His Cys Thr Ala Lys Trp Lys Thr Val Cys Ala Pro 
50 55 60 
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Cys Pro Asp His Tyr Tyr Thr Asp Ser Trp His Thr Ser Asp Glu Cys 

65 70 75 80 

Leu Tyr Cys Ser Pro Val Cys Lys Glu Leu Gin Tyr Val Lys Gin Glu 
85 90 95 



Cys Asn Arg Thr His 
100 

Leu Glu lie Glu Phe 
115 

Giy Val Val Gin Ala 
130 

Cys Pro Asp Gly Phe 
145 

Arg Lys His Thr Asn 
165 



Asn Arg Val Cys Glu Cys 
105 

Cys Leu Lys His Arg Ser 
120 

Gly Thr Pro Glu Arg Asn 
135 

Phe Ser Asn Glu Thr Ser 
150 155 

Cys Ser Val Phe Gly Leu 
170 



Lys Glu Gly Arg Tyr 
110 

Cys Pro Pro Gly Phe 
125 

Thr Val Cys Lys Arg 
140 

Ser Lys Ala Pro Cys 
160 

Leu Leu Thr Gin Lys 
175 



Gly Asn Ala Thr 
180 

Gin Lys Cys Gly 
195 

Phe Ala Val Pro 
210 

Asp Asn Leu Pro 
225 

Lys Arg Gin His 



His Asp Asn lie 

lie Asp Val Thr 
200 

Thr Lys Phe Thr 
215 

Gly Thr Lys Val 
230 

Ser Ser Gin Glu 
245 



Cys Ser Gly Asn 
185 

Leu Cys Glu Glu 

Pro Asn Trp Leu 
220 

Asn Ala Glu Ser 
235 

Gin Thr Phe Gin 
250 



Ser Glu Ser Thr 
190 

Ala Phe Phe Arg 
205 

Ser Val Leu Val 

Val Glu Arg lie 
240 

Leu Leu Lys Leu 
255 
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Trp Lys His Gin Asn Lys Ala Gin Asp lie Val Lys Lys lie lie Gin 
260 265 270 

Asp lie Asp Leu Cys Glu Asn Ser Val 'Gin Arg His lie Gly His Ala 
5 275 280 285 

Asn Leu Thr Phs Glu Gin Leu Arg Ser Leu Met Glu Ser Leu Pro Gly 
290 295 300 

10 Lys Lys Val Gly Ala Glu Asp lie Glu Lys Thr lie Lys Ala Cys Lys 

305 310 315 320 

Pro Ser Asp Gin lie Leu Lys Leu Leu Ser Leu Trp Arg lie Lys Asn 
325 330 335 

15 

Gly Asp Gin Asp Thr Leu Lys Gly Leu Met His Ala Leu Lys His Ser 
340 345 350 

Lys Thr Tyr His Phe Pro Lys Thr Val Thr Gin Ser Leu Lys Lys Thr 
20 355 360 365 

lie Arg Phe Leu His Ser Phe Thr Met Tyr Lys Leu Tyr Gin Lys Leu 
370 375 380 

25 Phe Leu Glu Met lie Gly Asn Gin Val Gin Ser Val Lys He Ser Cys 

385 390 395 400 



Leu 



30 



(2) INFORMATION FOR SEQ ID NO: 13 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 
35 (B) TYPE: amino acid 

( D ) TO POLOG Y : linear 
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(ii) MOLECULE TYPE : peptide 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Asn Lys Trp Leu Cys Cys Ala Leu Leu Val Leu Leu Asp lie lie 
15 10 15 

Glu Trp Thr Thr Gin Glu Thr Leu Pro Pro Lys Tyr Leu Kis Tyr Asp 
20 25 30 

Pro Glu Thr Gly His Gin Leu Leu Cys Asp Lys Cys Ala Pro Gly Thr 
35 40 45 

Tyr Leu Lys Gin His Cys Thr Val Arg Arg Lys Thr Leu Cys Val Pro 
50 55 60 

Cys Pro Asp His Ser Tyr Thr Asp Ser Trp His Thr Ser Asp Glu Cys 
65 ' 70 75 80 

Val Tyr Cys Ser Pro Val Cys Lys Glu Leu Gin Ser Val Lys Gin Glu 
85 90 95 

Cys Asn Arg Thr His Asn Arg Val Cys Glu Cys Glu Glu Gly Arg Tyr 
100 105 110 

Leu Glu lie Glu Phe Cys Leu Lys His Arg Ser Cys Pro Pro Gly Ser 
115 120 125 



Gly Val Val Gin Ala Gly Thr Pro Glu Arg Asn Thr Val Cys Lys Lys 
130 135 140 K 



Cys Pro Asp Gly Phe Phe Ser Gly Glu Thr Ser Ser Lys Ala Pro Cys 
145 150 155 160 

lie Lys His Thr Asn Cys Ser Thr Phe Gly Leu Leu Leu lie Gin Lys 
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165 170 175 

Gly Asn Ala Thr His Asp Asn Val Cys Ser Gly Asn Arg Glu Ala Thr 
180 185 190 

5 

Gin Lys Cys Gly lie Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg 
195 200 205 

Phe Ala Val Pro Thr Lys lie lie Pro Asn Trp Leu Ser Val Leu Val 
10 210 215 220 

Asp Ser Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg lie 
225 230 235 240 

15 Lys Arg Arg His Ser Ser Gin Glu Gin Thr Phe Gin Leu Leu Lys Leu 

245 250 255 

Trp Lys His Gin Asn Arg Asp Gin Glu Met Val Lys Lys lie lie Gin 
260 265 270 

20 

Asp lie Asp Leu Cys Glu Ser Ser Val Gin Arg His Leu Gly His Ser 
275 280 285 

Asn Leu Thr Thr Glu Gin Leu Leu Ala Leu Met Glu Ser Leu Pro Gly 
25 290 295 300 

Lys Lys lie Ser Pro Glu Glu lie Glu Arg Thr Arg Lys Thr Cys Lys 
305 310 315 320 

30 Ser Ser Glu Gin Leu Leu Lys Leu Leu Ser Leu Trp Arg lie Lys Asn 

325 330 335 

Gly Asp Gin Asp Thr Leu Lys Gly Leu Met Tyr Ala Leu Lys His Leu 
340 345 350 



3^ 



Lys Thr Ser His Phe Pro Lys Thr Val Thr His Ser Leu Arg Lys Thr 
355 360 365 
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Met Arg Phe Leu His Ser'Phe Thr Met: Tyr Arg Leu Tyr Gin Lys Leu 



370 



375 



380 



Phe Leu Glu Met He Gly Asn Gin Val Gin Ser Val Lys II 



e Ser Cys 



385 



390 



395 



400 



Leu 



(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TCCCTGACAA CACAAGCTCA 2 0 

(2) INFORMATION FOR SEQ ID NO: 15: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TCCATTTCTC CCGGTTCTG 19 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 18 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 Cii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 16: 
CCATGTTGAC CGTACCAC 13 

15 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : cDNA 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID MO: 17 



GCACTCTCGG TCAGTCAA 18 
30 (2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TTTTGAATTC CAGCCAGAAC AGAAGGCCTC GA 3 2 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNSSS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TTTTTCTAGA TACCTTTGGT CTTTGGGAAC 3 0 

(2) INFORMATION FOR SEQ ID NO : 2 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
AAAAAAGAAT TCGCCGCCAT GGGGACCTCT 3 0 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 3 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: pepcide 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 : 
CTTGTCGTCG TCGTCCTTGT AGTCGTACCT TTG 3 3 

15 (2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2638 base pairs 

(B) TYPE : nucleic acid 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

25 (ixi FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 510. .2327 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 22: 

30 

CGGGTCACGT TTGCGGGTAC CCACCCACAC CCGCCGCGCT TAATGCGCCG CTACAGGGCG 6 0 



CGTCCCATTC GCCATTCAGG CTGCGCAACT GTTGGAAGGG CGATCGGGTG CGGGCCTCTT 12 0 



J 3 CGCTATTACG CCAAGCTGGC GAAAGGGGGA TGTGCTGCAA GGCGATTAAG TTGGGTAACG 18 0 



CCAAGGGTTT TCCCAGTCAC GACGGTTGTA AAACGACGGC CAGTGAATTG AATTTAGGTG 24 0 
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ACACTATAGA AGAGCTATGA CGTCGCATGC ACGCGTACGT AAGCTTGGAT CCTCTAGAGC 300 

GGCCGCGCCG CTGGGCAGGT GCTGAGCGCC CCTAGAGCCT CCCTTGCCGC CTCCCTCCTC 360 

5 

TGCCCGGCCG CAGCAGTGCA CATGGGGTGT TGGAGGTAGA TGGGCTCCCG GCCCGGGAGG 42 0 

CGGCGGTGGA TGCGGCGCTG GGCAGAAGCA GCCGCCGATT CCAGCTGCCC CGCGCGCCCC 480 

10 GGGCGCCCCT GCGAGTCCCC GGTTCAGCC ATG GGG ACC TCT CCG AGC AGC AGC 53 3 

Met Gly Thr Ser Pro Ser Ser Ser 
1 5 

ACC GCC CTC GCC TCC TGC AGC CGC ATC GCC CGC CGA GCC ACA GCC ACG 531 

15 Thr Ala Leu Ala Ser Cys Ser Arg lie Ala Arg Arg Ala Thr Ala Thr 
10 15 20 



ATG ATC GCG GGC TCC CTT CTC CTG CTT GGA TTC CTT AGC ACC ACC ACA 629 
Met lie Ala Gly Ser Leu Leu Leu Leu Gly Phe Leu Ser Thr Thr Thr 
20 25 30 35 40 



2D 



GCT CAG CCA GAA CAG AAG GCC TCG AAT CTC ATT GGC ACA TAC CGC CAT 6 7" 

Ala Gin Pro Glu Gin Lys Ala Ser Asn Leu lie Gly Thr Tyr Arg His 
45 • 50 55 

GTT GAC CGT GCC ACC GGC CAG GTG CTA ACC TGT GAC AAG TGT CCA GCA 72; 

Val Asp Arg Ala Thr Gly Gin Val Leu Thr Cys Asp Lys Cys Pro Ala 
60 65 70 



30 GGA ACC TAT GTC TCT GAG CAT TGT ACC AAC ACA AGC CTG CGC GTC TGC 77 3 

Gly Thr Tyr Val Ser Glu His Cys Thr Asn Thr Ser Leu Arg Val Cys 
75 80 35 

AGC AGT TGC CCT GTG GGG ACC TTT ACC AGG CAT GAG AAT GGC ATA GAG 821 
35 Ser Ser Cys Pro Val Gly Thr Phe Thr Arg His Glu Asn Gly lie Glu 
90 95 100 
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AAA TGC CAT GAC TGT AGT CAG CCA TGC CCA TGG CCA ATG ATT GAG AAA 869 
Lys Cys His Asp Cys Ser Gin Pro Cys Pro Trp Pro Met lie Glu Lys 
105 110 115 120 

5 TTA CCT TGT GCT GCC TTG ACT GAC CGA GAA TGC ACT TGC CCA CCT GGC 917 
Leu Pro Cys Ala Ala Leu Thr Asp Arg Glu Cys Thr Cys Pro Pro Gly 
125 130 135 

ATG TTC CAG TCT AAC GCT ACC TGT GCC CCC CAT ACG GTG TGT CCT GTG 96 5 

10 Met Phe Gin Ser Asn Ala Thr Cys Ala Pro His Thr Val Cys Pro Val 
140 145 150 

GGT TGG GGT GTG CGG AAG AAA GGG ACA GAG ACT GAG GAT GTG CGG TGT 1013 
Gly Trp Gly Val Arg Lys Lys Gly Thr Glu Thr Glu Asp Val Arg Cys 
15 155 160 165 

AAG CAG TGT GCT CGG GGT ACC TTC TCA GAT GTG CCT TCT AGT GTG ATG 1061 

Lys Gin Cys Ala Arg Gly Thr Phe Ser Asp Val Pro Ser Ser Val Met 

170 175 180 

20 

AAA TGC AAA GCA TAC ACA GAC TGT CTG AGT CAG AAC CTG GTG GTG ATC 110 9 

Lys Cys Lys Ala Tyr Thr Asp Cys Leu Ser Gin Asn Leu Val Val lie 

185 190 195 200 

25 AAG CCG GGG ACC AAG GAG ACA GAC AAC GTC TGT GGC ACA CTC CCG TCC 115 7 

Lys Pro Gly Thr Lys Glu Thr Asp Asn Val Cys Gly Thr Leu Pro Ser 
205 210 215 

TTC TCC AGC TCC ACC TCA CCT TCC CCT GGC ACA GCC ATC TTT CCA CGC 12 0 5 

30 Phe Ser Ser Ser Thr Ser Pro Ser Pro Gly Thr Ala lie Phe Pro Arg 
220 ( 225 230 

CCT GAG CAC ATG GAA ACC CAT GAA GTC CCT TCC TCC ACT TAT GTT CCC 125 3 

Pro Glu His Met Glu Thr His Glu Val Pro Ser Ser Thr Tyr Val Pro 
35 235 240 245 
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AAA GGC ATG AAC TCA ACA GAA TCC AAC TCT TCT GCC TCT GTT AGA CCA 13 01 

Lys Gly Met Asn Ser Thr Glu Ser Asn Ser Ser Ala Ser Val Arg Pro 
250 255 260 



5 AAG GTA CTG AGT AGC ATC CAG GAA GGG ACA GTC CCT GAC AAC ACA AGC 134 9 

Lys Val Leu Ser Ser lie Gin Glu Gly Thr Val Pro Asp Asn Thr Ser 
265 270 275 280 



TCA GCA AGG GGG AAG GAA GAC GTG AAC AAG ACC CTC CCA AAC CTT CAG 13 97 

Ser Ala Arg Gly Lys Glu Asp Val Asn Lys Thr Leu Pro Asn Leu Gin 
285 290 295 



GTA GTC AAC CAC CAG CAA GGC CCC CAC CAC AGA CAC ATC CTG AAG CTG 144 5 

Val Val Asn His Gin Gin Gly Pro His His Arg His lie Leu Lys Leu 
15 300 305 310 

CTG CCG TCC ATG GAG GCC ACT GGG GGC GAG AAG TCC AGC ACG CCC ATC 14 9 3 

Leu Pro Ser Met Glu Ala Thr Gly Gly Glu Lys Ser Ser Thr Pro lie 

315 320 325 

20 

AAG GGC CCC AAG AGG GGA CAT CCT AGA CAG AAC CTA CAC AAG CAT TTT 1541 

Lys Gly Pro Lys Arg Gly His Pro Arg Gin Asn Leu His Lys His Phe 

330 335 340 



GAC ATC AAT GAG CAT TTG CCC TGG ATG ATT GTG CTT TTC CTG CTG CTG 158 9 

Asp lie Asn Glu His Leu Pro Trp Met lie Val Leu Phe Leu Leu Leu 
345 350 355 360 



GTG CTT GTG GTG ATT GTG GTG TGC AGT ATC CGG AAA AGC TCG AGG ACT 163 7 

30 Val Leu Val Val lie Val Val Cys Ser lie Arg Lys Ser Ser Arg Thr 

365 1 370 375 

CTG AAA AAG GGG CCC CGG CAG GAT CCC AGT GCC ATT GTG GAA AAG GCA 16 8 5 

Leu Lys Lys Gly Pro Arg Gin Asp Pro Ser Ala lie Val Glu Lys Ala 
35 380 385 390 
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GGG CTG AAG AAA TCC 
Gly Leu Lys Lys Ser 
395 

5 TAG TAC TGC AAT GGC 
Tyr Tyr Cys Asn Gly 
410 

CAA GTG GGA AGC CAG 
10 Gin Val Gly Ser Gin 
425 



ATG ACT CCA ACC CAG AAC CGG 
Met Thr Pro Thr Gin Asn Arg 
400 

CAT GGT ATC GAT ATC CTG AAG 
His Gly lie Asp lie Leu Lys 
415 420 

TGG AAA GAT ATC TAT CAG TTT 
Trp Lys Asp lie Tyr Gin Phe 
430 435 



GAG AAA TGG ATC 173 3 

Glu Lys Trp -lie 

405 

CTT GTA GCA GCC 17 31 

Leu Val Ala Ala 

CTT TGC AAT GCC 18 2 9 

Leu Cys Asn Ala 
440 



AGT GAG AGG GAG GTT GCT GCT TTC TCC AAT GGG TAC ACA GCC GAC CAC 13 7 7 

Ser Glu Arg Glu Val Ala Ala Phe Ser Asn Gly Tyr Thr Ala Asp His 
15 445 450 455 



20 



GAG CGG GCC TAC GCA GCT CTG CAG CAC TGG ACC ATC CGG GGC CCC GAG 192 5 

Glu Arg Ala Tyr Ala Ala Leu Gin His Trp Thr lie Arg Gly Pro Glu 

460 465 470 

GCC AGC CTC GCC CAG CTA ATT AGC GCC CTG CGC CAG CAC CGG AGA AAC 1973 

Ala Ser Leu Ala Gin Leu He Ser Ala Leu Arg Gin His Arg Arg Asn 
475 480 485 

25 GAT GTT GTG GAG AAG ATT CGT GGG CTG ATG GAA GAC ACC ACC CAG CTG 2 021 

Asp Val Val Glu Lys He Arg Gly Leu Met Glu Asp Thr Thr Gin Leu 
490 495 500 

GAA ACT GAC AAA CTA GCT CTC CCG ATG AGC CCC AGC CCG CTT AGC CCG 2 06 9 

30 Glu Thr Asp Lys Leu Ala Leu Pro Met Ser Pro Ser Pro Leu Ser Pro 
505 510 515 520 



AGC CCC ATC CCC AGC CCC AAC GCG AAA CTT GAG AAT TCC GCT CTC CTG 2117 
Ser Pro He Pro Ser Pro Asn Ala Lys Leu Glu Asn Ser Ala Leu Leu 
35 525 530 535 
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ACG GTG GAG CCT TCC CCA CAG GAT TTG CTA TTT AAG TGG CTT GAC AAC 216 5 

Thr Val Glu Pro Ser Pro Gin Asp Leu Leu Phe Lys Trp Leu Asp Asn 

540 545 550 



5 TGG GCC ACC AAA GAA CTT GAA CTT CAC CTT TTA GGA TTT GAG CTG TTC 2213 
Trp Ala Thr Lys Glu Leu Glu Leu His Leu Leu Gly Phe Glu Leu Phe 
555 560 565 

TGG AAC ACA TTG CTG CAC TTT GGA AAG TCA AAA TCA AGT GCC AGT GGC 2261 
10 Trp Asn Thr Leu Leu His Phe Gly Lys Ser Lys Ser Ser Ala Ser Gly 
570 575 580 

GCC CTT TCC ATA GAG AAT TTG CCC AGC TTT GCT TTA AAA GAT GTC TTG 23 0 9 

Ala Leu Ser lie Glu Asn Leu Pro Ser Phe Ala Leu Lys Asp Val Leu 
i5 585 590 595 600 



TTT TTT ATA TAC ACA T AAT C AAT AG GTCCAATCTG CTCTCAAGGC CTTGGTCCTG 2 3 64 

Phe Phe lie Tyr Thr 
605 

20 

GTGGGATTCC TTCACCAATT ACTTTAATTA AAAATGGCTG CAACTGTAAG AACCCTTGTC 24 2 4 

TGATATATTT GCAACTATGC TCCCATTTAC AAATGTACCT TCTAATGCTC AG TTG C CAG G 24 8 4 

25 TTCCAATGCA AAGGTGGCGT GGACTCCCTT TGTGTGGGTG GGGTTTGTGG GTAGTGGTGA 2 544 

AGGACCGATA T C AG AAAAAT GCCTTCAAGT GTACTAATTT ATTAATAAAC ATTAGGTGTT 2 6 04 

TGTTACTTAA AAAAAAAAAA AAAAGGGCGG CCGC 2 63 8 

30 

(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 amino acids 
35 (3) TYPE: amino acid 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: procein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

5 Met Gly Thr Ser Pro Ser Ser Ser Thr Ala Leu Ala Ser Cys Ser Arg 
15 10 15 

lie Ala Arg Arg Ala Thr Ala Thr Met lie Ala Gly Ser Leu Leu Leu 
20 25 30 

10 

Leu Gly Phe Leu Ser Thr Thr Thr Ala Gin Pro Glu Gin Lys Ala Ser 
35 40 45 

Asn Leu lie Gly Thr Tyr Arg His Val Asp Arg Ala Thr Gly Gin Val 
15 50 55 60 

Leu Thr Cys Asp Lys Cys Pro Ala Gly Thr Tyr Val Ser Glu His Cys 
65 70 75 80 

20 Thr Asn Thr Ser Leu Arg Val Cys Ser Ser Cys Pro Val Gly Thr Phe 

85 90 95 

Thr Arg His Glu Asn Gly lie Glu Lys Cys His Asp Cys Ser Gin Pro 
100 105 110 

25 

Cys Pro Trp Pro Met lie Glu Lys Leu Pro Cys Ala Ala Leu Thr Asp 
115 120 125 

Arg Glu Cys Thr Cys Pro Pro Gly Met Phe Gin Ser Asn Ala Thr Cys 
30 130 135 140 

Ala Pro His Thr Val Cys Pro Val Gly Trp Gly Val Arg Lys Lys Gly 
145 150 155 160 

35 Thr Glu Thr Glu Asp Val Arg Cys Lys Gin Cys Ala Arg Gly Thr Phe 

165 170 175 
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Ser Asp Val Pro Ser Ser Val Met Lys Cys Lys Ala Tyr Thr Asp Cys 
180 185 190 

Leu Ser Gin Asn Leu Val Val lie Lys Pro Gly Thr Lys Glu Thr Asp 
5 195 200 205 

Asn Val Cys Gly Thr Leu Pro Ser Phe Ser Ser Ser Thr Ser Pro Ser 
210 215 - 220 

10 Pro Gly Thr Ala lie Phe Pro Arg Pro Glu His Met Glu Thr His Glu 
225 230 235 240 

Val Pro Ser Ser Thr Tyr Val Pro Lys Gly Met Asn Ser Thr Glu Ser 
245 250 255 

15 

Asn Ser Ser Ala Ser Val Arg Pro Lys Val Leu Ser Ser lie Gin Glu 
260 265 270 

Gly Thr Val Pro Asp Asn Thr Ser Ser Ala Arg Gly Lys Glu Asp Val 
20 275 280 285 

Asn Lys Thr Leu Pro Asn Leu Gin Val Val Asn His Gin Gin Gly Pro 
290 295 300 

25 His His Arg His lie Leu Lys Leu Leu Pro Ser Met Glu Ala Thr Gly 
305 310 315 320 

Gly Glu Lys Ser Ser Thr Pro lie Lys Gly Pro Lys Arg Gly His Pro 
325 330 335 

30 

Arg Gin Asn Leu His Lys His Phe Asp lie Asn Glu His Leu Pro Trp 
340 345 350 

Met lie Val Leu Phe Leu Leu Leu Val Leu Val Val lie Val Val Cys 
35 355 360 365 
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Ser He Arg Lys Ser Ser Arg Thr Leu Lys Lys Gly Pro Arg Gin Asp 
370 375 380 

Pro Ser Ala He Val Glu Lys Ala Gly Leu Lys Lys Ser Met: Thr Pro 
5 385 390 39S 400 

Thr Gin Asn Arg Glu Lys Trp He Tyr Tyr Cys Asn Gly His Gly He 
405 410 415 

10 Asp He Leu Lys Leu Val Ala Ala Gin Val Gly Ser Gin Trp Lys Asp 
420 425 430 

He Tyr Gin Phe Leu Cys Asn Ala Ser Glu Arg Glu Val Ala Ala Phe 
435 440 445 

15 

Ser Asn Gly Tyr Thr Ala Asp His Glu Arg Ala Tyr Ala Ala Leu Gin 
450 455 460 

His Trp Thr He Arg Gly Pro Glu Ala Ser Leu Ala Gin Leu He Ser 
20 465 470 475 480 

Ala Leu Arg Gin His Arg Arg Asn Asp Val Val Glu Lys He Arg Gly 
485 490 495 

25 Leu Men Glu Asp Thr Thr Gin Leu Glu Thr Asp Lys Leu Ala Leu Pro 
500 505 510 

Met Ser Pro Ser Pro Leu Ser Pro Ser Pro He Pro Ser Pro Asn Ala 
515 520 525 . 

30 

Lys Leu Glu Asn Ser Ala Leu Leu Thr Val Glu Pro Ser Pro Gin Asp 
530 535 540 

Leu Leu Phe Lys Trp Leu Asp Asn Trp Ala Thr Lys Glu Leu Glu Leu 
35 545 550 555 560 
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His Leu Leu Gly Phe Glu Leu Phe Trp Asn Thr Leu Leu His Phe Gly 
565 570 575 

Lys Ser Lys Ser Ser Ala Ser Gly Ala Leu Ser He Glu Asn Leu Pro 
5 580 585 590 

Ser Phe Ala Leu Lys Asp Val Leu Phe Phe lie Tyr Thr 
595 600 605 

10 (2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1815 base pairs 

( B ) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

20 (ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1815 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

25 

ATG GGG ACC TCT CCG AGC AGC AGC ACC GCC CTC GCC TCC TGC AGC CGC 4 8 

Met Gly Thr Ser Pro Ser Ser Ser Thr Ala Leu Ala Ser Cys Ser Arg 
15 10 15 

30 ATC GCC CGC CGA GCC ACA GCC ACG ATG ATC GCG GGC TCC CTT CTC CTG 96 
lie Ala Arg Arg Ala Thr Ala Thr Met lie Ala Gly Ser Leu Leu Leu 
20 25 30 

CTT GGA TTC CTT AGC ACC ACC ACA GCT CAG CCA GAA CAG AAG GCC TCG 14 4 

35 Leu Gly Phe Leu Ser Thr Thr Thr Ala Gin Pro Glu Gin Lys Ala Ser 
35 40 45 
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AAT CTC ATT GGC ACA TAC CGC CAT GTT GAC CGT GCC ACC GGC CAG GTG 192 

Asn Leu lie Gly Thr Tyr Arg His Val Asp Arg Ala Thr Gly Gin Val 

50 55 60 

5 CTA ACC TGT GAC AAG TGT CCA GCA GGA ACC TAT GTC TCT GAG CAT TGT 24 0 

Leu Thr Cys Asp Lys Cys Pro Ala Gly Thr Tyr Val Ser Glu His Cys 

65 70 75 80 

ACC AAC ACA AGC CTG CGC GTC TGC AGC AGT TGC CCT GTG GGG ACC TTT 2 83 

10 Thr Asn Thr Ser Leu Arg Val Cys Ser Ser Cys Pro Val Gly Thr Phe 

S5 90 95 

ACC AGG CAT GAG AAT GGC ATA GAG AAA TGC CAT GAC TGT AGT CAG CCA 3 36 

Thr Arg His Glu Asn Gly Xle Glu Lys Cys His Asp Cys Ser Gin Pro 

15 100 105 110 

TGC CCA TGG CCA ATG ATT GAG AAA TTA CCT TGT GCT GCC TTG ACT GAC 3 84 

Cys Pro Trp Pro Met lie Glu Lys Leu Pro Cys Ala Ala Leu Thr Asp 

115 120 125 

20 

CGA GAA TGC ACT TGC CCA CCT GGC ATG TTC CAG TCT AAC GCT ACC TGT 43 2 

Arg Glu Cys Thr Cys Pro Pro Gly. Met Phe Gin Ser Asn Ala Thr Cys 

130 135 140 

25 GCC CCC CAT ACG GTG TGT CCT GTG GGT TGG GGT GTG CGG AAG AAA GGG 48 C 

Ala Pro His Thr Val Cys Pro Val Gly Trp Gly Val Arg Lys Lys Gly 

145 150 155 160 

ACA GAG ACT GAG GAT GTG CGG TGT AAG CAG TGT GCT CGG GGT ACC TTC 52 8 

30 Thr Glu Thr Glu Asp Val Arg Cys Lys Gin Cys Ala Arg Gly Thr Phe 

165 170 175 

TCA GAT GTG CCT TCT AGT GTG ATG AAA TGC AAA GCA TAC ACA GAC TGT 5 76 

Ser Asp Val Pro Ser Ser Val Met Lys Cys Lys Ala Tyr Thr Asp Cys 

35 180 185 190 
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CTG AGT CAG AAC CTG GTG GTG ATC AAG CCG GGG ACC AAG GAG ACA GAC 624 
Leu Ser Gin Asn Leu Val Val lie Lys Pro Gly Thr Lys Glu Thr Asp 
195 200 205 

5 AAC GTC TGT GGC ACA CTC CCG TCC TTC TCC AGC TCC ACC TCA CCT TCC 6 72 

Asn Val Cys Gly Thr Leu Pro Ser Phe Ser Ser Ser Thr Ser Pro Ser 
210 215 220 

CCT GGC ACA GCC ATC TTT CCA CGC CCT GAG CAC ATG GAA ACC CAT GAA 72 0 

10 Pro Gly Thr Ala lie Phe Pro Arg Pro Glu His Met Glu Thr His Glu 
225 230 235 240 

GTC CCT TCC TCC ACT TAT GTT CCC AAA GGC ATG AAC TCA ACA GAA TCC 76 3 

Val Pro Ser Ser Thr Tyr Val Pro Lys Gly Met Asn Ser Thr Glu Ser 
15 245 250 255 

AAC TCT TCT GCC TCT GTT AGA CCA AAG GTA CTG AGT AGC ATC CAG GAA 816 

Asn Ser Ser Ala Ser Val Arg Pro Lys Val Leu Ser Ser lie Gin Glu 

260 265 270 

20 

GGG ACA GTC CCT GAC AAC ACA AGC TCA GCA AGG GGG AAG GAA GAC GTG 8 64 

Gly Thr Val Pro Asp Asn Thr Ser Ser Ala Arg Gly Lys Glu Asp Val 

275 280 285 

25 AAC AAG ACC CTC CCA AAC CTT CAG GTA GTC AAC CAC CAG CAA GGC CCC 912 
Asn Lys Thr Leu Pro Asn Leu Gin Val Val Asn His Gin Gin Gly Pro 
290 295 300 

CAC CAC AGA CAC ATC CTG AAG CTG CTG CCG TCC ATG GAG GCC ACT GGG 96 0 

30 His His Arg His lie Leu Lys Leu Leu Pro Ser Met Glu Ala Thr Gly 
305 310 315 320 

GGC GAG AAG TCC AGC ACG CCC ATC ' AAG GGC CCC AAG AGG GGA CAT CCT 1003 
Gly Glu Lys Ser Ser Thr Pro lie Lys Gly Pro Lys Arg Gly His Pro 
35 325 330 335 
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AGA CAG AAC CTA CAC AAG CAT TTT GAC ATC AAT GAG CAT TTG CCC TGG 10 56 

Arg Gin Asn Leu His Lys His Phe Asp lie Asn Glu His Leu Pro Trp 
340 345 350 



3 ATG ATT GTG CTT TTC CTG CTG CTG GTG CTT GTG GTG ATT GTG GTG TGC 1104 
Met lie Val Leu Phe Leu Leu Leu Val Leu Val Val lie Val Val Cys 
355 360 365 

AGT ATC CGG AAA AGC TCG AGG ACT CTG AAA AAG GGG CCC CGG CAG GAT 1152 
10 Ser lie Arg Lys Ser Ser Arg Thr Leu Lys Lys Gly Pro Arg Gin Asp 
370 375 380 

CCC AGT GCC ATT GTG GAA AAG GCA GGG CTG AAG AAA TCC ATG ACT CCA 12 00 

Pro Ser Ala lie Val Glu Lys Ala Gly Leu Lys Lys Ser Met Thr Pro 
15 385 390 395 400 

ACC CAG AAC CGG GAG AAA TGG ATC TAC TAC TGC AAT GGC CAT GGT ATC 124 8 

Thr Gin Asn Arg Glu Lys Trp lie Tyr Tyr Cys Asn Gly His Gly lie 
405 410 415 

20 

GAT ATC CTG AAG CTT GTA GCA GCC CAA GTG GGA AGC CAG TGG AAA GAT 12 9 6 

Asp lie Leu Lys Leu Val Ala Ala Gin Val Gly Ser Gin Trp Lys Asp 
420 425 430 

25 ATC TAT CAG TTT CTT TGC AAT GCC AGT GAG AGG GAG GTT GCT GCT TTC 13 4 4 

lie Tyr Gin Phe Leu Cys Asn Ala Ser Glu Arg Glu Val Ala Ala Phe 
435 440 445 

TCC AAT GGG TAC ACA GCC GAC CAC GAG CGG GCC TAC GCA GCT CTG CAG 13 92 

30 Ser Asn Gly Tyr Thr Ala Asp His Glu Arg Ala Tyr Ala Ala Leu Gin 
450 ' 455 460 

CAC TGG ACC ATC CGG GGC CCC GAG GCC AGC CTC GCC CAG CTA ATT AGC 14 4 0 

His Trp Thr lie Arg Gly Pro Glu Ala Ser Leu Ala Gin Leu lie Ser 
35 465 470 475 480 
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GCC CTG CGC CAG CAC CGG AGA AAC GAT GTT GTG GAG AAG ATT CGT GGG 14 8 3 

Ala Leu Arg Gin His Arg Arg Asn Asp Val Val Glu Lys lie Arg Gly 
485 490 495 

5 CTG ATG GAA GAC ACC ACC CAG CTG GAA ACT GAC AAA CTA GCT CTC CCG 15 3 6 

Leu Met Glu Asp Thr Thr Gin Leu Glu Thr Asp Lys Leu Ala Leu Pro 
500 505 510 

ATG AGC CCC AGC CCG CTT AGC CCG AGC CCC ATC CCC AGC CCC AAC GCG 15 84 

10 Met Ser Pro Ser Pro Leu Ser Pro Ser Pro lie Pro Ser Pro Asn Ala 
515 520 525 

AAA CTT GAG AAT TCC GCT CTC CTG ACG GTG GAG CCT TCC CCA CAG GAT 16 32 

Lys Leu Glu Asn Ser Ala Leu Leu Thr Val Glu Pro Ser Pro Gin Asp 
15 530 535 540 

TTG CTA TTT AAG TGG CTT GAC AAC TGG GCC ACC AAA GAA CTT GAA CTT 16 8 0 

Leu Leu Phe Lys Trp Leu Asp Asn Trp Ala Thr Lys Glu Leu Glu Leu 

545 550 555 560 

20 

CAC CTT TTA GGA TTT GAG CTG TTC TGG AAC ACA TTG CTG CAC TTT GGA 172 8 

His Leu Leu Gly Phe Glu Leu Phe Trp Asn Thr Leu Leu His Phe Gly 

565 570 575 

25 AAG TCA AAA TCA AGT GCC AGT GGC GCC CTT TCC ATA GAG AAT TTG CCC 17 76 

Lys Ser Lys Ser Ser Ala Ser Gly Ala Leu Ser lie Glu Asn Leu Pro 
580 585 590 

AGC TTT GCT TTA AAA GAT GTC TTG TTT TTT ATA TAG ACA 1815 
30 Ser Phe Ala Leu Lys Asp Val Leu Phe Phe lie Tyr Thr 
595 • 600 605 
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(2) INFORMATION FOR SEQ ID NO : 2 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
10 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 5 : 

Pro Ala Thr Leu Tyr Ala Val Val Glu Asn Val Pro Pro Leu Arg Trp 
15 1 5 .10 15 

Lys Glu Phe Val Arg Arg Leu Gly Leu Ser Asp His Glu lie Asp Arg 
20 25 30 

20 Leu Glu Leu Gin Asn Gly Arg Cys Leu Arg Glu Ala Gin Tyr Ser Met 

35 40 45 

Leu Ala Thr Trp Arg Arg Arg Thr Pro Arg Arg Glu Ala Thr Leu Glu 
50 55 60 

25 

Leu Leu Gly Arg Val Leu Arg Asp Met Asp Leu Leu Gly Cys Leu Glu 
65 70 75 80 

Asp lie Glu Glu Ala Leu 
30 85 

{2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 8 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Ser Lys Tyr He Thr Thr He Ala Gly Val Met Thr Leu Ser Gin Val 
1 .5 10 15 

Lys Gly Phe Val Arg Lys Asn Gly Val Asn GIu Ala Lys He Asp Glu 
20 25 30 

He Lys Asn Asp Asn Val Gin Asp Thr Ala Glu Gin Lys Val Gin Leu 
35 40 45 

Leu Arg Asn Trp His Gin Leu His Gly Lys Lys Glu Ala Tyr Asp Thr 
50 55 60 

Leu He Lys Asp Leu Lys Lys Ala Asn Leu Cys Thr Leu Ala Glu Lys 
65 70 75 8 0. 

He Gin Thr He He 
85 

(2) INFORMATION FOR SEQ ID NO : 2 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 2 7 : 
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Ser Leu Lys Asp Gin Gin Thr Phe Ala Arg Ser Val Gly Leu Lys Trp 
15 10 15 

Arg Lys Val Gly Arg Ser Leu Gin Arg Gly Cys Arg Ala Leu Arg Asp 
20 25 30 

Pro Ala Leu Asp Ser Leu Ala Tyr Giu Tyr Glu Arg Glu Gly Leu Tyr 
35 40 45 

Glu Gin Ala Phe Gin Leu Leu Arg Arg Phe Val Gin Ala Glu Gly Arg 
50 55 60 

Arg Ala Thr Leu Gin Arg Leu Val Glu Ala Leu Glu Glu Asn Glu Leu 
6 5 70 75 80 

Thr Ser Leu Ala Glu Asp Leu Leu Gly Leu Thr 
85 90 

(2) INFORMATION FOR SEQ ID NO : 2 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Leu Cys Ala Ala Phe Asn Val lie Cys Asp Asn Val Gly Lys Asp Trp 
15 10 15 

Arg Arg Leu Ala Arg Gin Leu Lys Val Ser Asp Thr Lys lie Asp Ser 
20 25 30 
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Ile Glu Asp Arg Tyr Pro Arg Asn Leu Thr Glu Arg Val Arg Glu Ser 
35 40 45 

5 Leu Arg lie Trp Lys Asn Thr Glu Lys Glu Asn Ala Thr Val Ala His 

50 55 60 

Leu Val Gly Ala Leu Arg Ser Cys Gin Met Asn Leu Val Ala Asp Leu 
65 70 75 80 

10 

Val Gin Glu Val Gin 
85 

(2) INFORMATION FOR SEQ ID NO : 2 9 : 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 

Thr Asp Lys His Leu Asp Pro lie Arg Glu Asn Leu Gly Lys His Trp 
15 10 15 

30 Lys Asn Cys Ala Arg Lys Leu Gly Phe Thr Gin Ser Gin lie Asp Glu 

20 25 30 

lie Asp His Asp Tyr Glu Arg Asp Gly Leu Lys Glu Lys Val Tyr Gin 
35 40 45 



3D 



Met Leu Gin Lys Trp Val Met Arg Glu Gly He Lys Gly Ala Thr Val 
50 55 60 
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Gly Lys Leu Ala Gin Ala Leu His Gin Cys Ser Arg lie Asp Leu Leu 
65 70 75 80 

5 Ser Ser Leu lie Tyr Val Ser 

85 

(2) INFORMATION FOR SEQ ID NO : 3 0 : 

10 {i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY ; linear 



15 (ii) MOLECULE TYPE: peptide 



20 



35 



FRAGMENT TYPE : incernal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 0 : 



Gly lie Asp lie Leu Lys Leu Val Ala Ala Gin Val Gly Ser Gin Trp 
15 10 15 

Lys Asp lie Tyr Gin Phe Leu Cys Asn Ala Ser Glu Arg Giu Val Ala 

25 20 25 30 

Ala Phe Ser Asn Gly Tyr Thr Ala Asp His Glu Arg Ala Tyr Ala Ala 
35 40 45 

30 Leu Gin His Trp Thr lie Arg Gly Pro Glu Ala Ser Leu Ala Gin Leu 

50 55 60 

lie Ser Ala Leu Arg Gin His Arg Arg Asn Asp Val Val Glu Lys lie 
65 70 75 80 



Arg Gly Leu Met 
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10 



(2) INFORMATION FOR SEQ ID NO : 3 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 
{ B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE : internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 : 

15 Gly lie Asp lie Leu Lys Leu Val Ala Ala Gin Val Gly Ser Gin Trp 

15 10 15 

Lys Asp He Tyr Gin Phe Leu Cys Asn Ala Ser Glu Arg Glu Val Ala 
20 25 30 

20 

Ala Phe Ser Asn Gly Tyr Thr Ala Asp His Glu Arg Ala Tyr Ala Ala 
35 40 45 

Leu Gin His Trp Thr He Arg Gly Pro Glu Ala Ser Leu Ala Gin Leu 
25 50 55 60 

He Ser Ala Leu Arg Gin His Arg Arg Asn Asp Val Val Glu Lys He 
65 70 75 80 



30 Arg Gly Leu Met 
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